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PREFACE 


© 


analysis is nowadays necessary to the study and practise of 

educational, vocational, or other branches of applied psycko- 
logy. The mathematical techniques of factorization ase less 
essential for those who are not actually going to carry out analyses, 
and they are omitted in this book. Even without them the topic 
is a difficult one, but I have tried to present it simply and to Show 
that it does have extremely important practical bearings. I assume 
only that the reader has had an elementary course in psychology 
and knows what an intelligence test and a correlation coefficient 
are. One section of the book which is inevitably more technical 
than the rest is relegated to an appendix. 

My other chief aim is to bring together the large number of 
publications in this field, in Britain and America, which at first 
sight appear to give contradictory and confusing accounts of 
mental structure, and to show that they can be fitted into one 
consistent—even if incomplete—picture. This involves the re- 
working or reinterpretation of the results of many authors. Almost 
all the contributions from about 1935 to the middle of 1949 are 
critically surveyed (though less attention is paid to earlier litera- 
ture), and several unpublished analyses carried out by British 
psychologists in the Services are described. The book does not, 
however, attempt to cover studies of personality factors, attitudes 
and interests, or other fields outside that of abilities. 

Too many colleagues to mention by name have encduraged me 
to write this book, or have made useful comments on earlier drafts. 
I am indebted to them, also to my wife for her assistance in pre- 
paring the manuscript and index. Acknowledgment is due to the 
National Institute of Industrial Psychology for permission to re- 
print several sections of an article (Vernon, 1949a). It may be 
noted here that dates are inserted after authors’ names as refer- 


ences to the bibliography. P.E. V 


So acquaintance with the principles and results of factor 


CONTENTS 


Chapter Page 
PREFACE vii 
I. MENTAL FACULTIES AND FACTORS 41 


II. LANDMARKS IN THE DEVELOPMENT OF FACTOR ANALYSIS 11 


9 
III. HIERARCHICAL GROUP-FACTOR THEORY OF THE STRUC- 


TURE OF ABILITIES ; 25 
IV. ANALYSES OF EDUCATIONAL ATTAINMENTS 37 
V. INTELLECTUAL FACULTIES 49 
VI. VERBAL AND NON-VERBAL FACTORS IN INTELLIGENCE 
TESTS 64 
VII. PRACTICE, DIFFICULTY, SPEED, AND OTHER FACTORS 76 
° 
VIII. SENSATION, PERCEPTION, IMAGERY, AND AESTHETIC 
ABILITIES 87 
IX. PSYCHOMOTOR AND PHYSICAL ABILITIES 96 
X. PERFORMANCE TESTS AND MECHANICAL (utes z 107 
XI. OCCUPATIONAL ABILITIES a A 121 
APPENDIX: GENERAL-}GROUP FACTOR @S. MULTIPLE 
FACTOR THEORIES 129 
BIBLIOGRAPHY r. 136 
153 


INDEX 


FIGURES 
. Two-Factor, Group-Factor and Multiple-Factor Analyses 


. Diagram Illustrating Hierarchical Structure of Human 
Abilities 


. Diagram of the Structure of Educational Abilities 


Diagram of Intellectual and Practical Factors in Psycho- 
logical Tests 


. Diagram of Sensory, Perceptual, Imagery, and Aesthetic 
Discrimination Factors 


. Graphs of Factor Loadings in 15-year and 18-year 
Artificer Apprentices 


. Diagram Illustrating the Structure of Occupational 
Abilities 


Page 
18 


22 


47 


85 


94. 


117 


127 


TABLES 


e 


Table Page 

I. Correlation Coefficients Between Six Psychological 
Tests 5 
II. G-Loadings of the Six Tests and their Products * = 6 

III. Residual Correlations after Subtracting the Over- . 

lapping Attributable to G 6 

IV. Completed Factor Analysis of Six Psychological 
Tests . ET 

V. Simple Summation and Group-Factor Analyses of 
Tests given to 1,000 Army Recruits 23 

VI. Averaged Correlations Between Different Types of 
Reading and Intelligence Tests (Gates, 1921) 45 

VII. Analysis of 17 Tests among 645 R.A.F. Ground 
Recruits 72 

VIII. Rotated Centroid Factors for Tests and Examina- 

tions taken by° 540 Candidates for the: Higher 
Civil Service 3 74 

IX. Group-Factor Analysis of Mechanical and Other 
Tests among 500 Ordinary Seamen 100 

X. Selected Correlations from the Minnesota Investiga- 
tion of Mechanical Ability 102 

XI. Correlations between Mechanical and Physical 
Tests after Extraction of G and V: ED 105 
XII. Factorization of Tests given to African Recruits 106 

XIII. Earle and Milner’s Correlations between Different 
Types of Tests 109 

XIV. Alternative Groupings of Thirteen Tests Analysed 
among 283 R.A.F. Fitters 113 

XV. Centroid and Group Factors among Mechanical 
Tests Applied to Army Driver Mechanics 115 


XVI. Rotated Centroid Factors among Tests Applied to 
312 Naval Air Mechanics 

XVII. Group-Factor Analyses among 200 Representative 

A.T.S. Recruits and 200 Special Operators 

XVIII. Centroid Factors in the Course Marks of Engine 
‘ Room Mechanic Trainees 

XIX. Centroid Factors in Workshop Proficiency Measures 

. among 122 Naval Electrical Mechanics 


117 


119 


124 


125 


CHAPTER I è 
MENTAL FACULTIES AND FACTORS 


Abstract. This chapter criticizes the ascription of human 
abilities and behaviour to hypothetical faculties, traits or powers 
of the mind, as exemplified in phrenology and in various educa- 
tional and other theories. An ability or factor should be thought 
of as a class or group of performances, and it should be admitted 
only if a number of measurements in this class (e.g. test results) 
overlap or correlate positively with one another. The basic màthe- 
matical methods of factor analysis are quite straightforward. By 
means of them we can find what tests, examinations, etc., really 
measure, i.e. their factor content. The terms g, group factor, 
communality, specificity and variance are explained. Some of the 
limitations of factor analysis and its relations to other psycho- 
logical methods are pointed out. 

Faculties and Phrenology. The faculties or powers of the 
human mind have been for centuries a matter of interest, not only 
to the ordinary man who wishes to explain his own conduct and 
that of other people, but also to the philosopher, psychologist and 
educationist. Until recent years, however, their nature and 
numbers were matters of pure speculation. Casual observation 
and introspection are incapable of providing scientific proof of 
their existence, and in consequence many past theories of human 
abilities and qualities and their organization were entirely fal- 
lacious, One of the most popular doctrines of the early“nineteenth 
century was the phrenology of Gall, Spurzheim and Coombe, 
which assumed that the strength of each faculty was indicated by 
the prominence of bumps on the appropriate parts of the skull. 
Thirty-seven faculties were recognized including Propensities 
(Amativeness, Combativeness, etc.), Sentiments (Self-esteem, 
Conscientiousness), Perceptive (Size, Colour, Number, Tune, 
Language), and Reflective (Comparison and Causality). We now 
know that traits and abilities are not located in particular parts of 
the brain, and that the growth of areas of the brain does not 
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produce protuberances on the skull. Indeed it has been said that, 
‘The bumps on a man’s skull tell you more about his wife's 
character than his own.’ But the criticism which mainly concerns 
us is that Gall provided no evidence that these faculties are the 
fundamental ones, nor that they are independent or self-con- 
sistent. Is the sense of size really distinct from that of form; may 
not both depend on some more basic capacity? Is the person good 
at reading and-spelling the English language necessarily good at 
languages in general?. How about memory, reasoning, mechanical 
sense; 2nd a host of other qualities absent from the list? 

Faculties in Educational Theory. Unscientific faculty 
psychology permeates educational theory and practice even at the 
present day. School subjects or new methods of teaching are 
introduced because they are alleged to develop such and such a 
faculty. For example, Nature Study stimulates powers of observa- 
tion, learning poetry develops the memory, and so on. A Board of 
Education circular in the 1930s justified physical training in schools 
not merely for its effects on the health of pupils, but because it 
enhances mental alertness, self-control and the team spirit. At 
one time it was thought that the reasoning faculty is only rudi- 
mentary in children up till the age of twelve, whereas memory is 
comparatively strong, hence primary schooling should concen- 
trate on drill subjects. Scientific research has largely contradicted 
these assumptions. Children of three years and younger often 
reason out the solutions to problems that interest them; the 
capacity for learning new material increases with age and is much 
superior in the young adult to that of the seven-year child, as 
Thorndike has shown. This capacity, moreover, cannot be im- 
proved all-round simply by practising the memorization of poetry, 
multiplication tables or spellings. And an experiment by Sutcliffe 
and Canham (1937) disproved the view that extra physical training 
in school hours is of sufficiert benefit to the mind to compensate 
for the loss of time from intellectual work. 

Faculties in Popular Thought and in Vocational Psycho- 
logy. Equally dubious is the view so often put forward by parents 
that their boy who is backward at school work makes up for it by 
being good with his hands, or the assumption that the quick 
worker is usually inaccurate. We tend to divide people up into 
types—the practical, the academic, the leader, the aesthete, etc., 
forgetting that most individuals are good at some practical things, 
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not at others. The boy who would make a good carpenter would 
not necessarily succeed as a plumber, nor a machinist as a civil 
engineer. The influence of this type theory manifests itself when 
employers distrust the psychologist’s paper-and-pencil spatial and 
mechanical tests for selecting tradesmen. Such tests, they think, 
are too ‘theoretical’ to pick the man with the practical ‘flair’ or to 
eliminate the ‘ham-handed’. The whole field of occupational 
selection and guidance has indeed been befogged by. unverified 
speculations about qualities underlying jobs. It is not only the 
layman who analyses a job as requiring ‘dexterity, alertness and 
concentration’. Psychologists have carried out more systematic 
studies of jobs, but many, particularly in Germany, have been 
equally guilty of assuming that names like these represent distinc- 
tive and consistent abilities (cf. Vernon and Parry, 1949). 
Definitions of Intelligence. It is psychologists again who, 
although they have been testing intelligence with some success for 
over forty years, have failed to reach any agreed definition as to 
what it is they are measuring. Binet frankly regarded it as a col- 
lection of faculties: ‘judgment, practical sense, initiative... 
adapting oneself to circumstances’. His scale, however, was com- 
posed of tests which would differentiate older from younger 
children, The only criterion that they were measuring judgment, 
etc., was his own opinion. Several psychologists have considered 
intelligence as the ability to profit from experience, which con- 
trasts with the mechanical, instinctive reactions of lower species. 
None of the commonly used tests appear to involve any such 
quality. In a famous symposium published in 1921, thirteen 
psychologists gave thirteen different views. Terman stressed 
capacity for abstract thinking, Dearborn capacity to learn, Colvin 
adjustment to environment, and so on. Actually there is much 
overlapping between such views, but further theoretical discussion 
will not get us anywhere. It will not.tell us just how much they 
have in common, nor what is the real essence of intelligence.* 
The Empirical Approach to Human Abilities, Based on 
Correlation. Psychologists nowadays tend to adopt a more 
operational or Behaviouristic outlook, though rejecting the wilder 
excesses of J. B. Watson’s doctrines. They realize the fruitlessness 
of mental entities such as faculties, which can never be directly 


1A useful summary of these and other definitions, and an explanation of 
Spearman’s two-factor theory, are given by Knight (1933). 
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observed nor verified, and prefer to deal with concepts directly 
derived from measurable activities of human beings. An ability is 
inferred from the fact that some people carry out certain tasks 
more rapidly or more correctly than others. Whether it also de- 
pends on some power in the mind is a matter which interests meta- 
physicians but not scientists. The clue leading to a scientific 
solution of the impasse is provided by the term overlapping, i.e. by 
correlation.. By means of correlation we can find whether the scores 
of a group of people on two or more tasks correspond or not, and 
therefore whether these tasks involve the same, or distinctive, 
abilities. If several tests presumed to measure a particular ability 
do not correlate positively with one another, that ability cannot be 
accepted as a useful conception. Take memory as an example. 
We all know that a schoolboy may have an excellent memory for 
cricket scores or names of motor-cars, and a poor memory for 
school work, and that a professor who remembers everything about 
his own subject may be absent-minded in daily life, or forgetful 
of names and faces. If these various kinds of memory are measured 
and inter-correlated and little or no agreement is found, it is ob- 
vious that there is no one general faculty of memory, but a lot of 
specific varieties. We need not demand that such tests correlate 
perfectly; they may show a limited amount of overlapping, and 
some may correlate more highly with the rest than others do. But 
only in so far as they do correlate can they be regarded as measur- 
ing a memory ability or factor. “Otherwise each test is merely 
measuring the ability specific to that test and to no other. It follows 
too that any test can be regarded as divisible into two portions 
which we call its communality and its specificity, i.e. what it has in 
common with other tests and what is specific to it alone. 

There is yet another possibility. Positive correlations between 
several tests designed to measure memory might arise if the tests 
were in fact all measuring some other, more fundamental, ability, 
say intelligence. Factorial technique enables us to examine this, 
and to discover whether or not there is overlapping over and 
above anything attributable to intelligence. We thus arrive at the 
definition of an ability given by the writer elsewhere (Vernon, 
1940): “It implies the existence of a group or category of per- 
formances which correlate highly with one another, and which 


are relatively distinct from (i.e. give low correlations with) other 
performances.’ 
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Illustration of the Mathematical Fundamentals of Factor 
Analysis. It is unfortunate that this approach to the analysis of 
abilities involves somewhat complicated mathematics, since this 
frightens or antagonizes many of the teachers, employers, and 
others who are most prone to discuss abilities unscientifically. Yet 
the basic principles are very simple, as the following hypothetical 


examples will show. 


TABLE I.—CORRELATION COEFF ICIENTS BETWEEN SL 
PSYCHOLOGICAL TESTS bk? 


Tests 
. Vocabulary 4°76 +°79 +°45 + “410 +34 
2. Analogies +76 +68 +44 +35 +26 
3. Classifications +79 +68 r 4:49 +°39 + -32 
4. Block Design +45 HA +49 +58 +:44 
5. Spatial +41 +35 +:39 +°58 +°55 
6. Formboard +34 +26 +°32 HH 


that might be obtained between 
six tests applied to a large group of children (Block Design and 
Formboard being given individually). Inspection suggests that 
the correlations between the first three and last three are relatively 


bal tests is partially distinct from ability 


small, i.e. that ability at ver! ly disti 
at practical or spatial tests. But the separation 1s incomplete. All 
the correlations are positive, showing that all tests have something 


in common, presumably of the nature of general intelligence. By 
the appropriate techniques we can find how far each test measures 
this general ability or factor which we shall call g, and Table II 
lists the loadings, saturations or correlations with g. Now if this 
was the only underlying ability, we could reproduce the test inter- 
correlations simply by taking the products of their g-loadings. 


For example: 


Table I gives the correlations 


rgg5—TggXTgg ='8X:5= -40 
Such products are listed in Table IT, and in Table III each pro- 
duct has been subtracted from the corresponding original corre- 
lation to show what overlapping, if any, remains. These are known 
as residual correlations. 
B 
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TABLE II.G-LOADINGS OF THE SIX TESTS AND THEIR 
PRODUCTS 


Products 
3 4 


. Vocabulary : s "64 -48 


. Analogies Š r "56 42 


. Classifications : 64 - -48 
. Plock Design 


. Spatial 


. Formboard 


TABEE III.—RESIDUAL CORRELATIONS AFTER SUBTRACTING 
THE OVERLAPPING ATTRIBUTABLE TO G 


2 3 


. Vocabulary +:20 +15 
. Analogies +:20 ++12 
. Classifications +:15 +12 


. Block Design —03 +:02 +:01 
. Spatial +01 +:00 —-01 
. Formboard +-:02 —-02 +-:00 


The residuals between the first three and last three tests are not 
all zero, but are so close to it that they can reasonably be attributed 
to chance errors in the original correlations. Within each group of 
three however the residuals are large, showing that distinct verbal 
and practical-spatial abilities. are present. Each set can be analysed 
separately, and if the following loadings are multiplied out, they 
exactly reproduce the residual correlations: 


Verbal-factor Spatial factor 
loading loading 
1. Vocabulary 5 4. Block Design 4 
2. Analogies 4 5. Spatial HI 


3. Classifications -3 6. Formboard -5 
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Subsidiary abilities, over and above g, are called group factors, 
since they run through a limited group of tests. It is preferable to 
name them by symbols, such as v for verbal, k for spatial, rather 
than giving them ability-names which may readily be misinter- 
preted. Similarly we use g to refer to the objectively established 
general factor, instead of the subjective and indefinable term 
intelligence. 

The communality of any test, i.e. its total factor-content, is 
shown by the squares of its factor loadings. Table IV lists these 
loadings, their squares, the communality (hè), and what is Jef@over 
from 1.0, i.e. the specificities. Thus we can state that the Voca- 
bulary test measures 64 per cent. g, 25 per cent. v, and the remain- 
ing 11 per cent. is specific. The Formboard is a much poorer g 


e 
TABLE IV.—COMPLETED FACTOR ANALYSIS OF SIX PSYCHO- 
LOGICAL TESTS 


Squares of 
Loadings 
ce Ve 


. Vocabulary 3 3 "64 25 


. Analogies bye Y *49 °16 


. Classifications | *8 * -64 
. Block Design | - 9 "36 
. Spatial Ġ à *25 


. Formboard $ z -16 
42:3 8:3 15-0 


test, only 16 per cent. of what it measures being attributable to the 
general factor, 25 per cent. to k, and 59 peracent. specific. Such 
figures are known as the variances of the factors, and the average 
variance of each factor is given in the bottom row. These figures 
represent the importance or size of the factors in this battery of six 


tests. 


1 Many factorists further subdivide this term into the unreliability or error- 
variance of the test, and its true specificity. For example if the reliability co- 
efficient of the Vocabulary test is shown to be -94, then the error variance is "06 
and the variance of the factor specific to the testalone ‘05 (cf. textbooks of psycho- 


logical statistics such as Guilford, 1936). 
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Factors Are Not Mental Elements Like Faculties. From 
this example it should be clear that a factor is a construct which 
accounts for the objectively determined correlations between tests, 
in contrast to a faculty which is a hypothetical mental power. We 
can if we wish go on to theorize about the psychological nature and 
origin of factors. Better, we can conduct experiments to discover 
just what performances involve a factor, among which groups of 
people it emerges, and what conditions affect it. But factors 
should be regarded primarily as categories for classifying mental 
or behavioural performances, rather than as entities in the mind 
or nervous system. Since by means of factor analysis we can 
reduce a large battery of tests to a few underlying factors there is a 
certain parallel to the analysis of chemical compounds into their 
constituent elements. But this analogy should not be pressed too 
far, for we shall see later that factors are much too fluid, too de- 
pendent on the particular groups and particular tests studied, to 
be compared with elements. For example we might expect, and 
will indeed find, that the factors in scholastic abilities are depend- 
ent on how school subjects are taught. Some teachers emphasize 
the connections between the various branches of mathematics, or 
between a country’s language and its literature and history, much 
more than others do, and this is likely to be reflected in the cor- 
relations and factors. 

Identification of Factors. How factors should be identified 
and named is a somewhat controversial point. According to Guil- 
ford (1940) the factorist studies the common material, formal and 
functional features in the tests which are loaded with a factor and 
from this deduces its nature. Most factors are defined by material 
(e.g. verbal, mechanical information, etc.). The form of the test— 
whether apparatus or paper-and-pencil, choice-response or crea- 
tive-response—has not yet been proven to have much influence. 
Functional factors involve-consideration of the testees’ mental 
processes, by means of introspections or job analysis procedures or 
both (e.g. reasoning, attention, etc,). Bentley (1948) and others 
have criticized the looseness of factorists’ terminology, and the 
subjectivity of their guesses about the nature of some of their 
factors. We agree with him that it is better to avoid names of 
hypothetical functions or faculties, but would claim that the old- 
fashioned procedure“ (still common among some vocational 
psychologists, psychiatrists, teachers and others) of assuming 
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that a faculty exists and that certain tests measure it, is very much 
more subjective. Factorists do not, in fact, rely on hunches but 
always try to provide objective confirmation of a factor by carrying 
out further analyses with other populations and with enlarged 
batteries of tests, with a view to defining its content and extent 
more accurately. S 

Some Limitations of Factor Analysis. The mistake should 
not be made of identifying the whole of the psychology of abilities 
with factor analysis. Vocational and educational selection and 
guidance must take account not only of personality traits> and 
interests which might profitably be expressed as factors ‘also, but 
also of relevant experience, home circumstances and the like. And 
although there is a strong case for substituting objective tests for 
the subjective judgments of an interviewer, in practice it is seldom 
possible to carry out such guidance without an interviewer to bring 
together all the data and to interpret them to the candidates (cf. 
Vernon and Parry, 1949). Still more important for the develop- 
ment of psychological science are experiments on conditions 
affecting the performance of skills and of mental tasks, for example, 
investigations of the design of equipment, or studies of the learning 
process, of concept formation, of physical or mental fatigue and 
boredom, and so forth. Here factor analysis is largely irrelevant, 
since it deals only with the end products of human thinking and 
behaviour, and throws little light on how these products come 
about in individual human beings. Factors are indeed a kind of 
blurred average, for though they derive from the common features 
displayed by a large group of people, they may stem from very 
diverse mental and physical processes in different people. Analysis 
does not even usually tell us which factors an individual uses in 
any given performance, though it probably could do so. Thus one 
individual may score well at a test through high g, another might 
get the same score by virtue of some group factor, yet another 
through specific ability at that particular test. 

The real need for factors arises as soon as we begin to discuss 
and name abilities or traits, and to compare the relative standing of 
different people on such faculties. Factor analysis is complement- 
ary, not opposed, to the approach of the experimental psycho- 
logist; but both are opposed to the layman’s unscientific specula- 
tions about human qualities and their underlying nature. 

It should be realized also that the ‘map’ of the mind so far 
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provided by factor analysis is very incomplete, although it repre- 
sents a remarkable advance over what was known at the beginning 
of the century. Factorial investigations normally require the 
application of at least a dozen tests (Americans prefer forty to fifty) 
to several hundred subjects, and the labour of calculating the 
correlations and extracting the factors is almost too great to be 
done without mechanical aids. Moreover, the results are so 
affected by the particular tests used, especially when the battery is 
small, and by the background, sex, age, and other characteristics of 
the populations tested, that it is only by co-ordinating the findings 
of numerous analyses that reasonable certainty begins to emerge. 
Finally we shall see that different analysts often interpret the same 


results differently, though the confusion to which this leads is more 
apparent than real. 


CHAPTER II 
o 
LANDMARKS IN THE DEVELOPMENT OF FACTOR 
ANALYSIS 
a o 

Abstract. Some of the investigations which contributed most to 
the development of factor analysis from 1904 to 1947 are surveyed. 
Until the 1930s the predominant view among American psycho- 
logists was that all abilities are highly specific. In Britain, the im- 
portance of the general factor, g, was demonstrated by Spearman, 
but the existence of additional sub-types of ability or group factors 
gradually emerged from the work of Burt, Kelley, Stephenson, El 
Koussy, Alexander, and others. Results obtained from analyses 
among recruits during the 1939-45 war confirmed the hierarchical 
theory, to which this book is committed. This holds that there are 
certain main types of ability over and above g (in particular the 
educational and the practical types), and that these themselves can 
be subdivided into numerous minor group factors. Thurstone, 
Guilford, and other factorists in America from 1938 on, opposed 
the notion of a general factor and hierarchy. Instead they showed 
that test inter-correlations could be accounted for by a number of 
independent types of ability or multiple factors, not unlike the 
nineteenth-century faculties. However, more recent work suggests a 
rapprochement between the hierarchical and the Thurstonian 
viewpoints. 

The Viewpoint of Early American Psychologists. In the 
late nineteenth century the method of correlation was’ devised, 
largely by Galton and Pearson, for measuring the agreement 
between two sets of scores. Some of the first applications of this 
method to mental functions were made by Wissler (1901) and 
Thorndike in America, with disconcerting results. Tests of re- 
action time and sensory acuity, for example, showed scarcely any 
correlation with the grades of college students. Apparently the 
‘alertness’, ‘concentration’, ‘Sensitivity’ or other qualities entering 
into these tests were not the same as the ‘alertness’, etc., involved 
in university work. Indeed for many years the notion of measuring 
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mental ability in general was discredited in America. The experi- 
ments of William James, Thorndike and others on transfer of 
training reinforced the view that abilities are highly specific. If 
practice in learning English poetry does not improve the ability to 
learn French poetry, there can be no such thing as general memory. 
As Stratton put it: ‘What you do to the mind by means of educa- 
tion knows its place; it never spreads. You train what you train.’ 
Probably no American psychologist of the present day adheres to 
this extreme specific view, which Spearman called the anarchic 
theory of mental structure. Nevertheless it exerted a profound 
influence right up to the 1930s. Muscio (1922) in Britain, and 
Perrin (1921) in America obtained extremely small correlations 
between different tests of manual skills, and, in a well-known study 
of mechanical ability at Minnesota University, Paterson and 
Elliot (1930) showed that mechanical capacities are far from 
unitary. In the field of personality and character also different 
tests of the same trait often failed to correlate, one of the most 
striking investigations being that of Hartshorne and May (1928). 
These authors concluded that we should try to develop honest 
habits among children in each specific life situation, rather than to 
train honesty in general. 

Spearman’s Two-Factor Theory. During the period 1900-30 
British psychologists, headed by Spearman, Thomson and Burt, 
followed a different course. It was in 1904 that Spearman pub- 
lished his correlations between sensory tests and estimates of 
intelligence which showed that: ‘all branches of intellectual 
activity have in common one fundamental function (or group of 
functions), whereas the remaining or specific elements of the 
activity seem in every case to be wholly different from that in all 
others.’ When correlations can be wholly accounted for by g, they 
tend to fali into what he called a hierarchical pattern.1 Later he 
developed the tetrad difference technique of proving that no signi- 
ficant factors other than g and specifics are present. 

The Abilities of Man (1927) contains the fullest account of 
Spearman’s theories and of the numerous supporting investiga- 


1 This term refers to 
should not be confused with the hierarchical structure of group factors, des- 


ment of factor analysis, particularly i ica, is revi 0) in 
Factor Analysis to Sevan ii lyin America, is reviewed by Wolfle (1940) 
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tions by himself and his students. In it he shows that neither the 
anarchic, nor what he calls the monarchic or oligarchic theories of 
the mind accord with the facts. The monarchic view reduces all 
abilities to a single capacity of general intelligence or ‘common 
sense’. This would imply that they are all perfectly correlated, and 
would make no allowance for the unevenness of people’s abilities 
along different lines. The oligarchic theory is the view that the 
mind is ruled by a number of separate powers or faculties. Spear- 
man’s two-factor theosy satisfactorily explained the tendency for 
all abilities to overlap to some extent, and yet to show considerable 
unevenness. The pupil who is good at English is usually superior 
also in arithmetic, and even at handwriting or in physical health. 
At the same time each subject involves its own specific, or s, factor; 
hence some pupils may be relatively better at English than at 
number work, or vice versa. The specific factors in practical and 
physical abilities are larger, their g-saturations smaller, hence dis- 
crepancies in these abilities are much more common. 

The two-factor theory provides a logical basis, also, for devising 
satisfactory tests of g. We need not, like Binet, choose tests or 
items which appear to involve judgment (or whatever we think 
intelligence consists of). Instead tests are taken which have been 
proved, by correlational analysis, to have high g-loadings. Each of 
these tests will have some specific content, but as these s-factors 
are, by definition, independent, when we combine several tests 
the various s’s will tend to cancel out, leaving us with a purer 
measure of g. 

The Nature of g and of Specific Factors. Although Spear- 
man wisely refused to identify g with intelligence or any other 
quality whose definition was controversial, he suggested that it 
depends on the general mental energy with which each individual 
is endowed. ‘The s-factors he compared to a large*number of 
mechanisms or engines, which could be activated by this energy. 
They are largely affected by education and training, whereas g is 
innate and ineducable. By studying the tests with high or low g- 
saturations he concluded that the outstanding psychological char- 
acteristic of the former is that they involve seeing relationships, or 
to use his own terms, the eduction of relations and correlates. In 
answering, say, an arithmetic problem, the pupil has to grasp the 
relations between the various data presented, and to deduce some- 
thing new in order to reach an answer which bears the correct rela- 
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tions to these data. By contrast, if the pupil is asked merely to 
repeat the multiplication tables which he has learnt by rote, no 
new relationships are involved. Hence the g-saturation of the 
former task is high, of the latter low. 

Thomson’s Criticisms of Spearman. Some of Spearman’s 
statistical techniques were strongly criticized by Thomson, and 
he argued that the two-factor theory was a possible, but not a 
necessary, inference from the statistical results (Brown and 
Thomson, 1921). The tendency towards Positive correlation and 
zero tetrad differences could equally well be explained if abilities 
depend on a very large number of small causes in the mind (cf. 
Theory of Bonds, p. 31). 

Spearman’s N eglect of Group Factors. The chief criticism 
that would be raised nowadays against Spearman’s views is that he 
failed’ to allow sufficiently for types of ability which, while less 
general than g, are certainly not specific. He admitted indeed that 
different number tests, also different mechanical, and certain other 
types of test, show residual correlations over and above g. But he 
ascribed this to the presence of common specific factors and in- 
sisted that such ‘specific overlap’ is very rare. Actually the notion 
of specific overlap is a contradiction in terms, and towards the end 
of his life Spearman did begin to recognize the existence of broad 
group factors such as the verbal and spatial, which arise from the 
overlapping of quite diverse s-factors. One reason why his own 
work, up to 1927, failed to yield evidence of group factors was that 
he and his followers were seldom able to test large populations. 
Hence any residual overlap that did appear was usually not 
Statistically significant; it might have arisen from chance errors in 
the correlations. But Spearman was unduly cautious and did not 
admit that lack of statistical significance does not disprove the 
existence of additional factors; it only fails to prove it. A large- 
scale experiment was carried nut by Brown and Stephenson (1933) 
with the avowed object of demonstrating the truth or falsity of the 
two-factor theory. Three hundred 10-year boys were given 
twenty varied tests. Some of the pairs of tests did in fact show 
correlation beyond that accounted for by g. But the authors 
attributed this to specific overlap, and on eliminating the disturb- 
ing elements they were naturally ablé to prove that g was the sole 
factor Present. Some years later Blakey (1940) re-analysed the 
correlations by Thurstone’s method, without omitting any of the 
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awkward overlap, and concluded that verbal, perceptual and 
spatial group factors were present, though their variance amounted 
only to 12.9 per cent. as compared with 41.2 per cent. attributable 
to g. 

It is noteworthy that if Spearman’s strict view was correct, 
educational or vocational guidance with the aid of tests would be 
impossible. We could not measure aptitude for linguistic or 
mechanical work by linguistic or mechanical tests, since both 
types of test would predict nothing but g. In fact the only tésts 
worth using would be the purest g ones. By means of these we 
could determine the general level of occupation or education for 
which an individual was suited, but could not differentiate between 
different types of ability at this level. The only possibility would 
be to apply tests covering the specific factors in each prospective 
job. Thus an assembly test might measure the s-component of 
mechanical assembly work, but would throw no light on aptitude 
for lathe operating or other mechanical jobs. 

In point of fact Spearman has proved much more nearly right 
than vocational and educational psychologists would wish him to 
be. We shall see later that group factors are generally more limited 
in scope than general, and highly specific, ones, so that it is indeed 
very difficult to differentiate types of aptitude. 

Burt’s Analysis of Scholastic Attainments. As early as 1909 
Burt had obtained suggestive evidence of a sensory discrimination 
group factor beyond g, and in subsequent years he explored the 
fields of imagery, temperament and scholastic attainments. His 
memorandum on The Distribution and Relations of Educational 
Abilities (1917) was a landmark since it provided clear evidence 
(which Spearman continued to ignore) of verbal, numerical and 
practical group factors in school subjects,* in addition toa general 
factor. Also he arrived at the fundamental formula for the Simple 
Summation technique of analysis, later rediscovered by Thurstone 


-and named the Gentroid method, and developed techniques of 
_ assessing group factors. The verbal factor appeared to be two-fold, 


one part including the more complex or literary subjects—Com- 
position, History, Geography and Science, the other including the 


1 Sli i 915-16), Carey inter-correlated the school examination 
mates eie AEA en found a distinctive practical factor in Writing, 
Painting and Needlework. There were indications also of a verbal factor in 
Composition, Reading and Spelling, but Geography, Science, History and 
Arithmetic appeared to depend only on the general factor. 
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simpler word-reading and spelling attainrnents. The practical 
group included Handwork, Drawing, Writing Quality and Speed. 
Substantially similar results were obtained with 613 ten-year 
children in 1939, except that the two types of verbal ability 
appeared to have amalgamated. For the average school subject 
the variance attributable to the general factor was 27°9 per cent., 
and to group factors 20-7 per cent. Another interesting point was 
that the general factor correlated highly, but not perfectly, with 
an intelligence test. This suggested that general scholastic ability 
is largely made up of g, but involves in addition such qualities as 
interest and industry. 

Kelley’s Crossroads in the Mind of Man (1928). In America, 
Kelley studied the inter-correlations of batteries of tests given to 
three groups of over a hundred pupils, aged around 13, 9 and 
34-6 years. By means of an elaborate and rather difficult technique 
which has seldom been used since, he established much the same 
pattern of verbal, number, rote memory, spatial, and speed factors 
at each level. The general factor was still the most prominent in 
all groups, but Kelley accorded it a much less important role than 
Spearman, interpreting it as heterogeneity due to differences in age 
or maturity, race, nurture, sex, etc. 

The Minnesota Study of Mechanical Ability. This investi- 
gation by Paterson and Elliot (1930) represents another assault on 
the two-factor theory. The finding tht the average correlation 
between some twenty-six tests applied to 13-year boys was only 
+ -17 is often cited to show that mechanical and motor abilities are 
highly specific. But actually the low correlations occurred chiefly 
among physical tests such as Dynamometer, Steadiness and 
Agility, and among certain questionnaires or assessments of inter- 
ests, which are likely to be unreliable at this age. Table X (p. 102) 
shows considerabie overlapping among the twelve more important 
measures in the main. experitnent with 100 boys. What the in- 
vestigators did show (cf. Carter, 1928) was that a single general 
factor as defined by Spearman, and substantiated by the tetrad 
difference or other techniques, does not fit their results. But they 
admitted the presence of group factors, though they did not com- 
mit themselves as to their nature. They claimed also that mechani- 
cal ability or abilities are almost independent of g, but then they 
were considering only a single verbal group test which, according 

1 Except by McDonough (1929). 
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to later work, probably involves almost as much v as g. Further 
their group is likely to have been fairly highly selected, and we 
shall see below that g-saturations are always reduced when all the 
testees are high (or all low) in g. Further deductions from these 
figures are given on p. 101f. 

Group Factors Established by Stephenson and El Koussy. 
Two more investigations employing Spearmanian techniques 
deserve special mention. In 1931, Stephenson gave seven verbal 
and eight non-verbal intelligence tests to 1,037 girls, aged around 
ten to twelve. Correlations between the non-verbal tests could be 
accounted for by a single factor, which he identified with g. The 
verbal tests were more complex, but their correlations with one 
another and with the non-verbal tests, could be accounted for by 
g and a verbal group factor. It should be pointed out, however, 
that Stephenson’s results do not disprove the alternative of 
another group factor of a spatial-perceptual nature in the non- 
verbal tests—that is a structure similar to that of Table IV. In 
terms of variances (roughly calculated by the present writer) 
Stephenson’s solution was: 

g k v Communiality 

Average non-verbal test 38% 0% . 0% 38% 

Average verbal test 36% 0% 138% 49% 


A solution which would’be more favoured nowadays, and which 
maintains the same communalities, would be: 
k v 


£ 
Average non-verbal test 31% 7% 0% 
Average verbal test 44% 0% 5% 


The symbol k for the spatial factor was first applied by El 
Koussy (1935) who gave twenty-six tests to 162 boys aged 11 
to 13, He showed by tetrad analysis that eight of these obtained 
loadings on such a factor with about the same variance as their 
g-loadings. According to introspective evidence all these tests 
seemed to require visual imagery for their successful solution. 
Other tests employing visual material, together with Cox’s 
Mechanical Explanations and Completion (i.e. mechanical com- 
prehension) tests, and school marks in woodwork and drawing, 
gave only low correlations with this factor (cf. p. 66). 
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Thurstone’s Multiple Factor Analysis. In 1931 Thurstone 
developed the centroid technique of analysis and applied it to 
measures of attitudes and to ratings of personality traits, where it 
was natural to expect—not a general factor and small subsidiary 
group factors—but a number of components of more nearly equal 
variance. The differences between two-factor, group-factor and 
Thurstonian multiple-factor analyses may be illustrated by the 
diagrams in Fig. 1. According to the third diagram, no factor runs 
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Fig. 1. Two-Factor, Group-Factor and Multiple-Factor Analyses 


through all the tests. Each covers a different, though often over- 
lapping, set ‘of tests. Thus the content of some tests can be as- 
cribed to one factor only, while others show significant loadings on 
two or even three factors. Note that the blank entries in the 
diagram are not usually zero loadings, but are so small that they 
can be attributed to chance. Ina group-factor analysis, however, 
every test has a general factor loading and a loading on one (or 
occasionally more than one) group factor. In each type of analysis, 
every test shows its own specific factor. 

Alexander’s Investigation. Apparently the first application 
of Thurstone’s method to abilities was that of Alexander (1935), 
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who gave large batteries of verbal and non-verbal intelligence 
tests and certain performance tests to groups of about one 
hundred Scottish primary school boys and girls (11-12 years), 
American secondary and technical school pupils (16-17), and adult 
women in a delinquent institution. For the technical school group 
he also had school examination marks. Actually the multiple 
factors that he obtained conformed quite closely to a group factor 
pattern. Thus in addition to g, there was a v factor in the verbal 
tests, and some of the more complex and constructive performante 
tests gave a practical group factor, which he called F. „It was on 
the basis of these results that Alexander developed his performance 
test scale, consisting of Cube Construction, Kohs Blocks and 
Passalong, for measuring ‘concrete’ or practical ability. Another 
important finding in the third group was that the measures of 
school attainment showed a separate group factor of their own, 
thus confirming Burt’s results, mentioned above. He called this 
factor X, and identified it, very plausibly, with the influence of 
personality and interests, i.e. with something in the nature of 
industriousness which affects all school work.1 

Thurstone’s Primary Mental Abilities. In 1938 Thurstone 
published the first of his long series of investigations of human 
abilities, namely an analysis of fifty-six tests given to 240 college 
students. This seemed to reveal a complete break with Spearman, 
since there was no g at all, but—much as in the personality field—a 
series of distinct multiple factors. The eight main or primary 


factors were identified as: 


V Verbal P Perceptual Speed J Inductive Reasoning 
N Number M Rote Memory D Deductive Reasoning 
W Word Fluency S Space or Visualization 


° 

Note that American multiple factors are usually assigned 
capital letters, while British g and group’factors receive small 
letters. Though their content and derivation are very different, 
the status of these primary factors is closely similar to that of 
nineteenth-century faculties, against which Spearman had battled 

1 Numerous previous writers had, of course, interpreted discrepancies be- 
tween I.Q. and E.Q. as due to emotional influences. In effect the X factor is 
much the same as the A.Q.—the accomplishment or achievement quotient. 


Alexander claimed yet another factor, Z, of doubtful reliability, which has since 
been identified by Yela (1949), after re-analysis, with Thurstone’s or Meili’s 


perceptual synthesis factors (cf. pp. 58, 89). 
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for over thirty years. Spearman (1939) was quick to point out that, 
as all Thurstone’s tests were positively inter-correlated, they could 
equally well be analysed to yield a large general factor and smaller 
group factors. Such alternative analyses of Thurstone’s figures 
wete, in fact, carried out by Holzinger and Harman (1938) and 
Eysenck (1939). The latter’s g obtained a variance of 30-8 per 
cent., and the combined group factors 23-5 per cent. The content 
of the group factors corresponded quite closely to that of 
Tnurstone’s primary factors, just as in Fig. 1/3 the multiple 
factors A_ B and C cover much the same tests as the group 
factors of Fig. 1/2. Although Thurstone’s solution of the 
factors underlying the tests is as legitimate mathematically as a 
general + group factor solution, he has not disproved the 
existence of a g. In effect he has divided it up among his seven 
group factors. The arguments for and against these alternatives 
(which are much less irreconcilable than might appear at first 
sight) are examined in the Appendix. 

Subsequent American Work. Almost all factorial psycho- 
logists in America! have followed Thurstone’s lead, and their 
results, like his, can readily be fitted into the picture of mental 
structure advocated in this book. With their vastly greater re- 
sources for applying huge batteries of tests to large groups, and for 
doing most of the donkey-work of calculation by machine, it is 
natural that they should be responsible for most of the advances of 
recent years in this field. 

Thurstone, aided by his wife and students, have greatly extended 
the above investigation. Several of the primary factors have been 
studied in more detail, by analysing the original tests for a factor 
along with others which helped to define it more accurately, or 
showed how it could be sub-divided. And comprehensive batter- 
ies, similar to the original one, have been given to high-school and 
younger pupils, even to 5~5 year olds. The results at differ- 
ent ages have been remarkably concordant. Thus the first six 
factors in the above list were clearly identified when sixty tests 
were analysed among 710 pupils aged around 14 years (‘Thurstone, 
L. L. and T. G., 1941), though Deduction disappeared and In- 
duction seemed to be better named R (Reasoning). P was also 
somewhat unstable, and has been omitted from the battery of 


1 With the exception of Holzinger, and R. B. Cattell, the latter a pupil of 
Spearman. 
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Primary Mental Abilities tests issued for measuring these factors. 
One noteworthy difference, however, is that the primary factors 
among children tend to be less independent. Since they correlate 
with one another moderately highly, they themselves can be 
analysed in the same way as tests are analysed, and they usually 
reveal a kind of super-factor, or what Thurstone calls a second- 
order general factor. Though he does not go so far as to identify 
this with g, he admits that it constitutes a bridge between his own 
and Spearman’s viewpoints. He now describes primary factors a8 
‘facilities’ in the mind, or ‘media of expression’, and regards 
second-order factors (of which g may be one) as more central 
(Thurstone, L. L., 1948). This theory is strongly reminiscent of 
Spearman’s general energy and specific engines. 

The U.S.E.S. Investigations. In 1945 there appeared the first 
report on large-scale researches by the United States Employment 
Service, Division of Occupational Analysis, into the development 
of a set of differential aptitude tests (cf. Staff, Division of Occupa- 
tional Analysis, 1945). Various batteries of about twenty tests 
(fifty-nine in all) were given to nine fairly large and representative 
groups of adult applicants for employment, totalling 2,156 per- 
sons. On analysis, the most stable or consistent factors, which 
recurred in most of the groups were: 

V Verbal P Perceptual T Motor Speed 
N Number Q Clerical F Finger Dexterity i, 
S Space ZL Logic M Manual Dexterity | 
A Aiming 
also a general factor. 

The best tests for measuring each factor have been issued as a> ~~ 
battery, lasting about three hours, and from a testee’s profile or 
pattern of factor scores, it is hoped to predict the type of accupation 
for which his aptitudes fit him (Dvorak, 1947). 

Factorial Studies in the U.S.A.A.F. During the 1939-45 war, 
large-scale testing of recruits occurred in Britain and America, and 
factor analyses were often carried out on populations of a thousand 
or more. Particularly extensive use of the technique was made by 
Guilford and his collaborators in the U.S. Army Air Force (Guil- 
ford 1948ab, Guilford and Lacey 1947, Davis 1947, Melton 1947). 
Studies of the job of the pilot and of other air-crew personnel 


1 The mathematics of oblique (i.e. correlated) factors are clearly described in 


the later editions of Thomson’s textbook (first published 1939). 
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suggested what abilities might be worth testing. Elaborate and ex- 
tremely ingenious tests for measuring each of these were devised, 
and factorization was applied to discover which were consistent 
and distinctive. By this means pilot aptitude itself was largely 
broken down into factors objectively definable by appropriate test 
batteries, instead of into subjectively determined qualities. More 
then twenty factors are claimed : 


~- Carefulness Mechanical Infor- Psychomotor Speed 
a mation 
Integration I, II Perceptual Speed Reasoning I, II and 
and III III 


Pilot Interest Spatial Relations I, 
Length Estimation II and III 
Planning Ability 
Memory I, Iland Social Science Inter- 
II Psychomotor Co-` est and Training 
ordination 
Mathematical In- Psychomotor Pre- Verbal 
terest and cision 
Training Visualization 


Factorial Studies in the British Services: Hierarchical 
Group Factor Theory. In this country, where most work was 
done on less selected samples of the population such as Navy and 
Army conscript recruits, the importance of g was amply confirmed 
(Vernon, 1947a). In eight analyses, g was found to cover more than 

twice as much variance as all group factors combined. Table V 
shows an analysis of thirteen tests given to 1,000 Army recruits, 


g 


Major group factors 
vied kim 


Minor group factors ETI aes 
ANAL 


iagram illustrating Hierarchical Structure of Human Abilities 


Specifi c factors 
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and brings out a feature which appears to be highly characteristic 
of mental structure, namely hierarchy. After the removal of g, 
tests tend to fall into two main groups: the verbal-numerical- 
educational on the one-hand (referred to as v-ed factor), and the 
practical-mechanical-spatial-physical on the other hand (referred 
to as k:m factor). If the analysis is sufficiently detailed, i.e. if 
sufficient tests are included, these types themselves sub-divide. 
The v-ed factor in Table V gives minor v and n (number) group 
factors. In other analyses (e.g. Table IX), k:m splits similarly int 
mechanical information, spatial, and manual subfactors.. Thûs a 
first approximation to mental structure is provided by the hier- 
archical diagram of Fig. 2, resembling a genealogical tree. Its 
advantages and limitations form the subject of the next chapter. 


TABLE V. SIMPLE SUMMATION AND GROUP FACTOR ANALYSES 
OF TESTS GIVEN TO 1,000 ARMY RECRUITS 


Tests Unrotated Centroid Factors Group Factors | 
I II II IV|h* |g km ed. v n/[ h 


0 Progressive 
Matrices *77 +°23 +10 —'16| -68 | +79 +17 
Dominoes 
(non-verbal) | *80 +:09 +-°19 —:12| +70 | +87 
Group Test 
70, Pt. I +74 +:16 +03 —'08| -58 | +78 > 


4 Squares +63 +:35 —:00 +'01| - +59 + 


o 

8 Assembly +37 +:54 —'15 +:28 
2 Bennett 
Mechanical | -69 +°33 —*17 +:07 


25 Verbal +88 —-24 —-26 —-14] 
Dictation +79 —-42 —*25 —-11 
14 Spelin 81 —-32 —:20 —:11 
elling +81 ——-32 m20 —: 

21 Tagenticnona +89 —-06 +11 —-15 
3a Arithmetic, 
Pt. I +84 —-29 +22 +:23 
Arithmetic, 
Pt. II +86 —-16 +°12 +-13 
23 ACTS. 
Arithmetic | +84 —'21 +'26 +:14 


Variance 
per cent. 59°8 8:5 3:1 2:2 


52.5 8:7 8:4 6:9 


The Relations Between Group-Factor and Multiple-Factor 
Analyses. Table V also serves to illustrate some of the resem- 
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blances and differences of centroid and group-factor analyses. Ina 
centroid analysis the first factor represents the highest common 
element in all the tests. It is not usually the same as g, but is a kind 
of average of the particular tests applied in the investigation. 
Subsequent factors, II, III and IV, are known as bipolar, since 
roughly half the tests receive positive, half negative, signs. ‘These 
successively divide the tests into contrasted groups, and although 
they may have no psychological meaning as they stand, yet they do 

‘usually reveal what group factors are present. Often this classifica- 
tion by bipolar factors tells us all we need to know, and in several 
examples in subsequent chapters the original, or unrotated, 
centroid factors alone are quoted. But it is preferable to transform 
the first and the bipolar factors into a series of factors where all the 
tests have either positive loadings, or zero or insignificant negative 
loadings, by means of what is called rotation of axes. This of 
course redistributes much of the variance of the first factor among 
the remaining ones. Actually the aim of rotation is to maximize the 
number of zero or insignificant loadings on each factor, so that as 
much as possible of the variance of each test is confined to a single 
factor. Thurstone calls this Simple Structure. Often such rotation 
does yield a general factor running through most or all of the tests, 
and smaller factors each confined to a few tests, in other words a 
group-factor pattern. But true group-factor analysis is carried out 
by assessing g-loadings first, and then analysing the residual cor- 
relations in each group of tests, as in Tables I-IV. A clear and 
much fuller account of these different types of analysis, and their 
inter-convertibility, is given by Burt (1944; cf. also 1938, 1939a, 
1940a, 1949). 

Other Methods of Analysis. This historical résumé must not 
omit to mention, however briefly, certain other approaches to 
factor aualysis which have been less widely applied than general 
or group factor, and simple summation or centroid, methods. 
Broadly speaking they are more accurate mathematically, but do 
not provide appreciably more psychological information about the 
make-up of the analysed tests, which would compensate for their 
much greater complexity and tediousness of calculation. They 
include Burt’s Weighted Summation, Lawley’s Maximum Likeli- 
hood, Hotelling’s Principal Components, and Kelley’s Principal 


Axes, methods. Explanations may be found in Thomson’s and 
Burt’s textbooks. 


CHAPTER III z 
HIERARCHICAL GROUP-FACTOR THEORY OF THE 
STRUCTURE OF ABILITIES 
Dd 

Abstract. The strict hierarchical picture of mental structure is 
an over-simplification. For the results of any factor analysis 
depend largely on the composition of the population tested (¢.g. 
its degree of selection), and on the number and kind of tests 
studied. Since by choosing suitable tests almost any specific factor 
can be turned into a group factor, it is suggested that only those 
group factors shown to have significant practical value in daily life 
are worth incorporating in the picture. It is doubtful whether 
group factors differentiate merely as a result of ageing or mental 
growth. Rather, their pattern or structure changes according to 
the type of education and training. Thomson’s theory of bonds 
gives a useful explanation of g and of ability group factors, but the 
influence of temperament and personality, physique, sex, age, 
interests, etc. should be taken into account. The fallacies of the 
layman’s conception of theofetical vs. practical, and other opposed 
types of abilities are discussed. 

The Hierarchical Diagram Should be Regarded Only as an 
Approximation. The hierarchical theory which was outlined at 
the end of the preceding chapter, was first put forward by Burt, 
under the influence of McDougall. In a recent article (1949) Burt 
describes how it originated, and shows that it applies in the fields 
of temperament and of anthropometric measurements, as well as 
to abilities. Though it is certainly ar improvement both on the 
original two-factor theory and on the ‘neo-faculty’ theory of 
American writers, it has numerous limitations and implications 
which we must now discuss. The more technical arguments for 
preferring it to the American viewpoint are given in the Appendix. 

A diagram such as Fig. 2 would be obtained only if an extensive 
battery of tests, covering—or at least sampling—most of the varie- 
ties of human abilities, could be applied to a very large and repre- 
sentative sample of the population. With one or two hundred 


o 
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testees the correlations are usually too unreliable for more than 
two to four group factors to be established at a time. In general a 
. minimum of three tests is needed to define a factor, hence only a 
few factors can be resolved in any one investigation with a limited 
battery of tests. Further, if such a battery consists only, or pre- 
dominantly, of a specialized type of test (e.g. all tests of sensory- 
motor abilities), the g and major group factors may fail to reveal 
themselves. The diagram is, in other words, a hypothetical inte- 
gration of all the factorial investigations that have been carried 
out, rather than an established fact. It is considerably expanded, 
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tests and in the learning of spellings, multiplication tables and 
poetry, would the notion of a rote memory group factor be accept- 
able. The same stricture holds for most of the manual dexterity, 
sensory-motor, and co-ordination factors that have so far been pro- 
posed. At the present moment the writer cannot think of any ob- 
jective basis for distinguishing between acceptable group factors, 
and narrow factors confined to the highly specialized types of test 
which psychologists delight in constructing. But he would suggest 
that factors which fail to contribute at least 5 per cent. to the 
variance of some measure of educational or occupational ‘pro- 
ficiency or other capacity in daily life should be relegated to the 
latter category. If, for example, g and v tests alone predict, the 
ability to learn and retain poetry to the extent of a correlation of 
-60 (i.e. a variance of 36 per cent.), then the addition of tests of 
rote memory to the predictive battery should raise the correlation 
to at least -64 (variance 41 per cent.) if the factor measured by 
these tests is to be acceptable. Such a criterion involves sub- 
jective judgment as to what constitutes a “capacity in daily life’, 
and is beset with many difficulties. But it appears preferable to a 
judgment of the broadness vs. narrowness of the tests which yield a 
distinctive factor. 

Relative Importance of Factors at Different Levels. The 
hierarchical group-factor viewpoint implies that most of the vari- 
ance of human abilities ir daily life is attributable to g and to 
highly specific (or very small group) factors, and that the role of 
the broader group factors is rather meagre. If our diagram could 
be worked out completely to cover all human abilities, the g- 
variance might amount to about 40 per cent., the major and minor 


1 This suggestion recalls Thomson’s (1939) argument that factor analysis is 
of little use in vocational or educational psychology, because the predictive value 
of tests can be established much more efficiently by multiple corrélation tech- 
nique. The writer would agree that the content of a test as determined by factor 
analysis at the present time often fails to revedl its truepredictive value, because 
the test’s specificity may embody other group factors which are particularly 
relevant or irrelevant to some job. For example a test of graph-reading is very 
useful in selecting radar operators, but when analysed it usually appears to con- 
sist purely of g + n + specificity. — More detailed analysis would how- 
ever break down part of the specificity into a minor group factor for graph- 
reading, i.e. a sub-division of #. This is the line that Guilford and Lacey fol- 
lowed in the U.S.A.A.F. After considerable experience of multiple correlation, 


the writer has come to the conclusion (Vernon and Parry, 1949) that it is much 
too efficient. It does not, like the factorial approach suggested in this note, 
sufficiently allow for the chance errors in the validity coefficients of selection 
tests, and prohibits the inclusion of two or more rather similar tests in order to 


improve the reliability of prediction. 
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group factors to some 10 per cent. each, and the remaining 
40 per cent. would consist of very narrow group factors and un- 
reliability. This means that, fair ly good predictions of ability in 
education, industry, or everyday life, can be achieved by g tests 
alone, and that somewhat more ground can be covered by tests of 
the main group factors. But only by much more detailed experi- 
mentation on tests relevant to particular jobs, or by work-sample 
methods (i.e. trying candidates out on the actual work), can much 
more than 50 per cent. accuracy be obtained. This explains why 
Stanford-Binet or Terman-Merrill 1.Q., or all-round intelligence 
as measured by reliable group tests, have considerable practical 
value both among children and adults, whereas more specialized 


tests add something but not very much in educational and voca- 
tional guidance. 
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makes it extremely difficult to fill out the details of our diagram 
accurately. We cannot expect to reach a final and complete map 
of the structure of abilities, since it necessarily varies with the kind 
of population studied. y 

Differentiation of Specialized Abilities with Age. The rela- 
tive prominence of g has often been thought to depend considerdbly 
on age. The writer at one time accepted the view that g tends to 
differentiate into more specialized abilities during adolescence and 
early adulthood. This, view is advocated by Garrett (1946), who 
summarizes several confirmatory investigations. But most ofthese 
compare college with high school, or high school with elementary 
school, populations, hence the smaller g-variance in the older groups 
may be due merely to their greater selectivity. ? 

Clark (1944) did choose groups of eleven-, thirteen- and fifteen- 
year pupils with the same distribution of group test I.Q.s} and 
found a decline in the average inter-correlation of the Primary 
Mental Abilities tests from -488 to -393. Other studies such as 
those of Swineford (1947), Reichard (1944) and Doppelt (1949) 
fail to support the theory. Anastasi (1936) summarizes a large 
number of investigations and shows that, though there are strong 
indications of alterations in factor patterns with age and training, 
the evidence for differentiation is far from unanimous. McNemar 
(1942b) carried out fourteen factorizations of Terman-Merrill scale 
items at mental age levels ranging from 2 years to 18 years. 
His results are irregular and show no sign of any consistent trend 
towards greater differentiation at later ages. This might be 
criticized on the grounds that the later items are less diverse in 
content than those for young children. However Balinsky (1941) 
factorized an identical battery of tests, namely the Wechsler- 
Bellevue scale, among groups aged 9, 12, 15, 25-9, 35-44 
and 50-59, all with average I.Q. 100. He obtained the following 
first factor variances: 38, 36,24, 20, 33 and 45. These suggest 
differentiation from 9 to 30 and then greater integration. But 
he neglected to ensure the same degree of heterogeneity among 
the testees at all ages, and when correction is made for this, his 
first factor variances show much the same irregularity and lack of 


any clear trend as McNemar’s.+ 
1 It is difficult to see how a crucial investigation of this problem could be 


planned even if strictly random samples could be tested at several age levels. 
For as Emmett (1949) points out, the content of the tests should be equally 
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Particularly striking are Williams’s (1948) results from the ap- 
plication of the same battery of ten intelligence, spatial and me- 
chanical tests to samples of 250 boys, carefully chosen to be 
representative, at the ages of 12, 13 and 14. Here the first factor 
variances were 51, 56 and 62 per cent., respectively, indicating that 
secondary education tends to produce greater integration, not 
specialization, of verbal and practical abilities. In a research by 
the writer the standard British naval battery of five tests was given 
td 1,171 boys leaving school at 14, and the results were compared 
with"those of 265 seamen recruits who had also left at 14 in the 
same district some four years Previously. Scores tended to rise 
with age on the spatial and mechanical tests, and to drop on the 
arithmetical ones (cf. Vernon and Parry, 1949), but the average 
inter-correlation and §-saturations were almost identical. T'he only 
significant change was 


arithmetic with mathematics (from - 642 to "379). The correlation 
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practised at school or in jobs they tend to become more specialized, 
though sometimes the teaching is of such a nature as to increase 
integration. Again regression or de-differentiation may often 
occur as the effects of past training wear off. It is conceivable that 
secondary schooling is more fragmentary in America than in 
Britain, and so apt to produce more differentiation between 12 
and 18 than is usual here. But undoubtedly the main reason for 
the apparent reduction in the importance of g in adults is that the 
testees are more homegeneous in ability. S 
It is because the majority of American investigations arë con- 
ducted with college students, aircraft pilots, high-school pupils and 
other selected groups, that their results so readily fall into inde- 
pendent primary factors, instead of g and group factors. But when 
more heterogeneous adult groups have been studied, a g has 
usually appeared. Thus Anastasi (1948) quotes American Army 
studies which showed almost as high correlations between verbal, 
numerical, spatial and mechanical tests as between different 
numerical, or different mechanical tests. An analysis of the U.S. 
Navy battery gave a g with variance of over 30 per cent., together 
with smaller mechanical, spatial and educational group factors 
(Staff, Test and Research Section, 1945). The emergence of a g 
in the Division of Occupational Analysis’s investigations has 
already been mentioned. ] 
Psychological Nature of Factors. We must next consider the 
nature of g and the group factors a little more closely. ‘Thomson 
has shown that the statistical fact that test inter-correlations can 
be largely accounted for by a single factor does not prove that such 
a factor represents any unitary power, or organ of the mind. It 
might also arise if the mind is thought to consist of an immense 
number of ‘bonds’, including inherited reflexes, acquired habits 
and associations, etc. A person’s performance at añy one test 


would involve the activation of a large number of such bonds, and 
f tests is given, the extensive sampling of 


if a miscellaneous set 0: 
y tend 


bonds would result in the positive correlations that actuall 
But he agrees that factors are useful concepts for 


to occur. 
inds of samples that may be 


describing the content of the various kinds i 
taken, provided that they are not reified into organs or faculties. 
In this book we accept Thomson’s view, and hold that factors over 
and above g arise, partly perhaps from hereditary influences, but 


mainly because an individual’s upbringing and education imposes a 
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certain grouping on his bonds. The v-ed factor is, as we shall see, 
a rather strongly unified group because our society gives a fairly 
uniform education to all its members. It does not readily break 
down into separate verbal, number, speed, reasoning, attention, 
memory or other factors because the abilities covered by these 
names tend to be developed differently in different schools and 
homes, though partially distinct minor group factors can often be 
established, especially in fairly homogeneous groups such as 
university arts students. On the practical ur k:m side there is, as 
Anastdsi points out, less cultural standardization, hence the k:m 
pole is more heterogeneous and amorphous than v:ed. It would 
appear to be not so much a positive practical ability as an aggregate 
of all non-symbolic capacities, or of bonds that are not usually 
affected by primary schooling. Nevertheless, evidence is given 
below’ that not only mechanical and spatial, but physical and 
manual, and some non-verbal &, perceptual and performance tests 
all have something in common over and above g. The kind of test 


which is most strongly saturated with this factor is the mechanical 
assembly test, presumably be 
non-scholastic activities. 
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personality, physique and other factors which have complex inter- 
actions with ability factors. Physique and physical health con- 
stitute an important dimension (or set of dimensions) which cer- 
tainly affects practical abilities; and physical defects of the senses 
in particular react on educational attainments. Sex influences not 
only v:ed-k:m, but also most of the lower-order group factors, guch 
as manual, imagery, etc. (Burt and Moore, 1912). Age is important 
in spite of the conclusions reached above. Thus among adults the 
spatial, manual and pkysical aspects of k:m tend to decline, whereas 
specialized mechanical skills and information probably go“on in- 
creasing almost to senility (cf. also deterioration, p. 57). Cattell 
(1946) has pointed out that g is somehow associated with such per- 
sonality traits as conscientiousness and with cultured interests. 
Terman’s work on gifted children, and studies of mental defec- 
tives, confirm this. Apparently therefore the bonds establisited by 
character training, and by the development of sentiments and 
attitudes are linked with the bonds responsible for our cognitive 
or intellectual activities. Doubtless interests greatly affect our 
more specialized abilities. It is known also that the fluency factor, 
measured by tests of richness of association with words or pictures, 
is connected with extraverted or cycloid trends, and that such 
physical or manual capacities as visual acuity, dark adaptation, 
agility, and finger dexterities are impaired among neurotics (Slater 
and Slater, 1944; Eysenek, 1947). Again Eysenck finds speed vs. 
accuracy in mental and manual operations to differ among hysteric 
and dysthymic (anxiety or obsessional) neurotics. 

Clearly then we are very far from a complete theory of the 
structure and nature of human abilities, and though it is useful to 
analyse them in isolation as though they were purely cognitive or 
motor, we should not forget that they are abstractions from the 
total personality structure. 9 

Conclusions Regarding g. Finally it may be seen from 
Thomson’s theory that g is not a fixed, purely inherited, quantity. 
Thomson interprets it as the total number of bonds. Presumably 
this is largely dependent on some psycho-physiological and innate 
property of the higher nervous system, but there is no reason why 
the number should not be affected by the use made of the mind, 
and by organic conditions such as brain injury and ageing. This 
fits in with modern research on the highly individual nature of 
mental growth (cf, Dearborn and Rothney, 1941; Fleming, 1948), 
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on the effects of schooling and the intellectual stimulus provided 
by people’s jobs (cf. Vernon and Parry, 1949), and on deterioration 
of mental efficiency in pathological conditions. 

Compensation Theories. We have considered the relation of 
the hierarchical theory to Spearman’s and Thurstone’s views. 
How does it stand vis-ù-vis the popular notion of compensation, 
and of opposed types of people? Actually it admits a large measure 
of truth in the contrast between the theoretical or academic and the 
practical, since they roughly describe our two major group factors. 
But these abilities are not invers 
they are independent only when the influence of gisignored. Thus 
in fact the majority of children who are superior educationally are 
also above average in mechanic 
hands, and even in physique, 
g- The Norwood Report’s se 
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Secondly, the groups with which we are most familiar above the 
age of 11+ are selected ones, and as already pointed out this 
reduces the g-loadings and exaggerates the group factors. Hence 
it is quite conceivable that in a grammar school there may be an 
inverse, or at least a negligible, correlation between, say, mathe- 
matical attainment and football. But if we could study the whdle 
range of 15-year pupils we should find that grammar school pupils 
are usually superior to secondary modern pupils not only in 
mathematics, but also at football. ` 

Thirdly, we are concerned here only with abilities, not with 
interests. The latter probably show much stronger contrasts than 
the former. Thus the adolescent with keen interests in reading or 
other v-ed activities is frequently (perhaps more often than`not) 
weakly interested in mechanical or athletic activities, and it may 
be that he devotes so little time to them that his potentially superior 
ability at such activities deteriorates. Nevertheless, the university 
professor with his high g can usually, if put to it, do better at things 
in which he is not much interested such as cooking a dinner and 
washing up without breaking the crockery, than can alow-g domes- 
tic servant. And it is by no means fanciful to suggest that the 
victories of the Jews in Palestine in 1948 over the Arabs (who tend 
to be more bellicose in interests) was largely due to their superior 
g and v-ed. 

‘Slow but sure’ is another popular compensation theory, which 
likewise ignores the influence of g and other factors that tend to 
make the quick worker more rather than less accurate. However, 
it is considered in more detail in Chapter VII, and is shown there 
to possess a modicum of truth. 

The notion of types of people, as distinct from types of ability, 
should also be discouraged. As Burt (1943) points out, there is no 
more justification for talking of an academic or practival type of 
child than for a tall or a short type. Just as the majority are inter- 
mediate in height, so there are many more who are about equally 
able in educational and practical activities than there are extreme 
cases, Ability types themselves are abstractions, since many abili- 
ties when factorized will be found to be loaded on two or more 
group factors, i.e. to be intermediate. But the grouping is more 
clear-cut than in the case of individuals, because it is often imposed 
by school syllabuses and other cultural institutions or norms. 

Note that our insistence on g does not involve any denial of 
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special talents in individual cases. Apart from such rarities as 
idiots savants, there certainly exist children and adults of mediocre 
g and educational attainment who develop outstanding talents in 
the fields of art or scientific invention, or become leaders in busi- 
ness, politics, warfare, etc. Such talents can to some extent be 
attributed to the possession of strong group factors, but per- 
sonality influences, drives and interests are probably still more 
important. The analysis and measurement of such influences by 
psychologists is far less advanced than that of abilities. Thus the 
warning given at the end of Chapter I against regarding factors as 


covering the whole psychology of human achievements should be 
reiterated. 


CHAPTER IV 
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ANALYSES OF EDUCATIONAL ATTAINMENTS 


Abstract. School marks yield a different structure from objective 
psychological tests because of the X-factor—a complex of person- 
ality traits, interests and background. This, together with g and 
v-ed form the major influence in all educational attainments in un- 
selected groups of children and adults, though differentiation 
according to subject-matter can readily be established in selected 
secondary school pupils or university students. The more drilled 
and mechanical aspects of v (verbal) and 7 (number) abilities 
differentiate most clearly, but there is insufficient evidence to 
justify contrasting ‘rote’ with ‘reasoning’ attainments. Many a 
priori classifications of types of reading and number ability lack 
empirical substantiation. For example word-knowledge (voca- 
bulary) and comprehension in reading come to much the same 
thing. However, mechanical, rate, vocabulary and comprehension 
aspects are partially distinguishable at advanced levels. 

The Industriousness Factor in School Marks. The psycho- 
logist’s v, n and other factors are usually based on tests which are 
fairly pure measures of the abilities at which they are aimed. A 
good vocabulary test, for example, should measure g, v, a small 
ry little else. Educational attainments, 
especially when measured by school or other examinations are 
naturally more complex, and we have already seen that a some- 
what ill-defined factor of industriousness + interest, which 
Alexander calls X, plays a prominent? part. Similar factors, vari- 
ously called interest, study, or ‘halo’ have been reported in 
American investigations by Holzinger and Swineford (1939), Sisk 
(1940), Carroll (1943) and Comrey (1949). Because of this, selec- 
tion for secondary or other higher education by means of g, v, nor 
other psychological tests alone is usually less successful than 
selection which also takes account of previous school work (cf. 
McClelland, 1942). At one time some psychologists did propose 
that children most likely to benefit from advanced schooling would 
D 


error component and ve 
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be those with the highest innate intelligence, rather than with the 
best attainments, but we realize now that this was short-sighted. 

Although interesting attempts have been made to measure or 
assess personality factors relevant to scholastic success, it is 
doubtful whether any are practically applicable on a large scale. 
When teachers’ judgments are studied, some are found to give 
excellent predictions, but others are less competent than objective 
ability tests. The overall result (if we may accept McClelland’s 
findings) is that such judgments add nothing worthwhile to 
estimates based on tests plus school marks, since anything of value 
which the average teacher knows about the industry, etc., of his or 
her pupils is already embodied in the marks, At the moment, 
therefore, we know little about X. , though further research would 
certainly be profitable. In particular we would like to know how 
far it depends on: 

(a) home background; (b) the ‘tone’ of the pupil’s school; 
(c) the stimulatingness of, or good teaching by, his teacher; 
(d) the pupil’s interests; (e) his temperamental characteristics, 

Overlap of School Examinations and Psychological Tests. 
That psychological tests and school marks often measure rather 
different things is shown not only by the imperfections of objective 
tests in picking good and poor scholars, and by Alexander’s work, 
but by such investigations as the following. Bradford (1946) 
presents data for 105 technical school boys on five varied subjects 
and nine paper-and-pencil or performance tests. There is a 
general factor with 24 per cent. variance and a bipolar with 
16 per cent. Separating all the school marks from all the tests. 
Another bipolar, distinguishing the more technical subjects and 
performance tests from the more linguistic subjects and tests has a 
variance of only 4 per cent. Drew’s results (cf. p. 111) are similar. 
Blackwell (1940) compared mathematical achievements of 100 
secondary school boys and«100 girls with scores on spatial and 
verbal tests Specially designed to measure the reasoning processes 
believed to be involved in mathematics. By rotation of axes she 


arrived at factors in which all types of measure are represented, 
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pupils, five school subjects and four tests yielded no such factor 
as Alexander’s or Bradford’s. Instead a clerical test fell in a cluster 
with English and Language marks, and qn intelligence test in the 
Maths-Science cluster. Art marks and spatial-mechanical tests pro- 
vided additional group factors of their own. Comrey (1949) also re- 
ports complex overlapping between test factors and courses of study 
among West Point cadets. It should be noted that only when re- 
sults such as these are obtained is differential diagnosis, i.e. the 
prediction of suitability for different types of school course, possible? 
Unitariness ofv-ed. We would expect attainment measures to 
correlate highly with one another because of their common g and 
X content. But there seems to be a common v-ed ability in addi- 
tion, since the isolation of distinctive sub-factors for different sub- 
jects is remarkably difficult among representative groups of adults 
and children. For example in the investigation of Army recrdits 
in Table V, the correlation between spelling and dictation tests 
was no higher than the correlation of either with the verbal ability 
Test 25. (Descriptions of this and other tests used in the Services 
may be found in Vernon, 1947b, or Vernon and Parry, 1949.) 
Apparently spelling ability is little if at all differentiated among 
average adults from the fluency + vocabulary ability on which this 
test probably depends, although according to Thurstone (1948) 
spelling is a highly distinctive factor among college adults. 
Though verbal and numerical abilities are usually separable, as in 
Burt’s original research and in Table V, they tend to show a good 
deal of overlap, over and above g. Thus in Schiller’s (1934) in- 
vestigation of twelve tests among 395 pupils aged around nine 
years, the correlation between Arithmetic Reasoning and Com- 
putation was no higher than the correlations of both tests with four 
reading tests and tests of verbal g. ee 
Sub-divisions of v:ed. V:ed breaks down more readily into 
specialized abilities in populations thapare mare homogeneous in 
educational level, such as technical schoolboys:or college students. 
Kerr’s (1942) research, already mentioned, gave a factor differ- 
entiating linguistic from mathematical-scientific subjects, though 
its variance was far smaller than that of the general factor. Again 
Wilson (1933) analysed three sets of School Certificate marks and 
found, in addition to a general factor (presumably a mixture of 
g + X + v:ed), group factors for Arithmetic-Algebra-Geometry, 
for French-English and History-English, for Art-Needlework and 
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Art-Handicraft. The mean general and group factor loadings both 
approximated to -53, i.e. about 28 per cent. variance. 

It might be supposed that group factors tend to become more 
prominent relative to general educational ability with older pupils. 
Yet Wolf (1939), when trying to develop aptitude tests for different 
types of courses at Yale University, found almost as high correla- 
tions (averaging -45) between first-year examinations in arts and 
science subjects as between different arts or different science sub- 

“jects (average +59). Among postgraduate student teachers Vernon 
(1939) still found the general educational component predominant. 
His correlations can be most simply analysed into av-ed factor with 
26 per cent. variance, and separate group factors for science sub- 
jects—Psychology, Arithmetic, Hygiene, Nature Study—and 
practical subjects—Speech Training, Teaching Skill, Physical 
Training—with combined variance 12 per cent. The remaining 
subjects—Education, Geography, English and History, depended 
entirely on the general factor. Among Army engineering cadets, 
nine tests of attainment in different branches of mathematics and 
physics and two intelligence tests were analysed. The marks were 
found to depend only to 5-3 per cent. on g, but to 49-3 per cent. 
on a mathematics-physics educational factor, Additional group 
factors, covering 18-5 per cent., involved: 


(1) Lower maths—Arithmetic and Algebra. 
(2) Higher maths—Trigonometry, Calculus, Co-ordinate Geo- 


metry. 
(3) Physics—Mechanics, Heat, Light, Electricity. 


The unitariness of v-ed is illustrated too by the success with 
which verbal or mathematical tests predicted ability among Ser- 
vice recruits at almost all jobs involving theory or bookwork, ex- 
cept the most highly specialized such as radio mechanics (cf. Vernon 
and Parry, 1949). Similarty recruits who had held clerical jobs as 
civilians were usually excellent at verbal or mathematical jobs such 
as telegraphist, electrical mechanic, etc. No doubt there is a limit 


r We have seen that a 
ts into linguistic and mathematical- 
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more advanced educational levels. Another type of classification 
has been suggested, namely into attainments involving rote know- 
ledge, including spelling and mechanical arithmetic, and attain- 
ments involving more reasoning such as reading comprehension, 
composition, and mathematics. Burt first made this distinction 
among linguistic subjects, but failed to confirm it in his later in- 
vestigation of a larger group of ten-year-olds (p. 15). Sutherland 
(1941), working with 134 eleven-year-olds, found, in addition to 
8, v and n, a small groupsfactor in spelling, mechanical arithmeti® 
and a number series test, which he tentatively labelled a memory 
factor; also an ‘induction’ factor in problem arithmetic and number 
series. Conceivably this represents a distinction between „the 
stages of schooling, the ‘rote’ subjects being those that are stu ied 
earliest in the school career. ; Possibly also there is a link with 
Thurstone’s W (word fluency) and V (verbal reasoning) factôrs. 
But it seems equally likely that the distinction arises merely from 
the higher g-content of the ‘reasoning’ subjects. At least there is 
No conclusive evidence so far against this explanation. 

It follows that v and’n are most readily isolated by rather ele- 
mentary tests. Thus Thurstone (1938a), Coombs (1941), Guilford 
and Lacey (1947) and others regard computation ability as most 
representative of their N factor. Similarly Vernon (1949b) founda 
mechanical reading and a spelling test to be more v-, less g- 
Saturated than two silent reading comprehension tests among 
15-year-olds. In the Services rote arithmetic (the first part of 
Naval or Army Test 3) and spelling or dictation were always much 
More strongly opposed to k:m tests than was mathematics (the 
Second part of Test 3). In any moderately selected group the 
Correlations were apt to sink to zero or negative values. It follows 
also that, in primary schools, children are much more likely to 
show unevenness or specific backwardness in elementary arith- 
Metical or verbal attainments than they are in, problem arithmetic 
and composition, or other higher, more g-saturated, subjects. 

Arithmetical-Mathematical Ability. Let us turn now to 
Possible divisions within the mathematics or the English fields. 
Most factorial studies tend to show much less differentiation than 
educationists are apt to suppose. In spite of the different g- 
Saturations of mechanical arithmetic and mathematics among re- 
Cruits (approximately -55 and -77, respectively), the correlations 
between these tests were always so high that a broad arithmetic- 
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mathematics group factor appeared unavoidable. Oldham (1937-8) 
claimed that Arithmetic, Algebra and Geometry yielded separate 
group factors among secondary and central school pupils, but it is 
difficult to see how her figures substantiate this. Although she 
used tests specially designed to avoid overlapping between the 
subjects, she obtained a common factor covering, on the average, 
57 per cent. of variance, very little of which was attributable to g 
(cf. also Wilson’s results, p. 39). More detailed testing could no 
doubt transform the s-factors for Algebra, Geometry, etc., into 
group factors; but they would still be quite small. In other words, 
pupils with special flairs for, or special disabilities at, particular 
branches of mathematics are rather rare. What Oldham’s figures 
did demonstrate, however, was the great amount of variation in 
correlations in different school classes, which could be ascribed to 
thé way the subjects were taught, and to the interconnections that 
teachers had established in their pupils’ minds. Effects of teaching 
on the structure of educational abilities was shown too in a study 
of 500 naval Air Mechanics, who took an ordinary school mathe- 
matics examination on entry to a training establishment, and, a 
few months later, an exactly parallel ‘progress’ test. They were 
coached in similar problems throughout the period. Both exami- 
nations were factorized along with naval tests, and the entry one 
resembled the mathematics part of Test 3 in its loadings, the 
Progress one resembled the arithmetic part. The coaching had 
transformed the ability from the ‘reasoning’ to the ‘rote’ type. 

An interesting point established in Sutherland’s (1941) re- 

` search, mentioned above, was that the familiar or unfamiliar set- 
ting of arithmetic problems does not affect their factor content. 
According to his rotations, tests involving familiar or unfamiliar 
Situations obtain g, v (i.e. general educational) and z loadings all 
close to “5, together with small loadings (+3) on his tentative 
induction’ factor. No investigations, to the writer’s knowledge, 
have shown any differentiation between mental and written 
arithmetic, or between money or other types of sums. 

Another research which attempted to define the essence of 
N was that of Coombs (1941), who analysed thirty-four tests 
among 223 high school pupils. He included several tests based on 
letters of the alphabet or shapes, which were designed to measure 
the same kind of functions as arithmetic tests. Actually their N 
loadings were all close to zero, showing that the ability is specific- 
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ally related to numbers. Nevertheless, they did bear out the 
hypothesis that N involves the application of a set of highly stereo- 
typed and practised rules. Tests based on shapes, i.e. relatively 
unfamiliar symbols, gave even lower saturations than those based 
on letters. Another conception, that of serial response, did not 
seem to be crucial; for it was actually the simplest two, three and 
four digit addition tests which had the highest loadings of all. 
However, Guilford and Lacey’s work indicates that, in high-grade 
groups such as aircraft pilots, the tests which give the purest 
measures of N must not be too mechanical. For they report higher 
saturations for subtraction and division than for addition and 
multiplication sums. This is borne out by the experience of 
British Army psychologists. 

Reading Ability or Abilities. As in arithmetic, so in English 
there are no definitely established classifications of sub-typeS of 
ability. We talk of literacy and illiteracy as though they constitute 
a general factor in all verbal subjects, and very likely we are correct. 
But we do not even know how far the reading and writing com- 
ponents of literacy are distinctive, nor whether creative com- 
position, knowledge of grammar, sentence structure and punctua- 
tion, or spelling are separable components of writing (cf. Vernon, 
1949b). Harris (1948) lists correlations between fifteen reading, 
writing and English measures in four groups of about fifty 
American Indian pupils. ‘But they are far too irregular for any 
definite factors to emerge other than a strong g ar v:ed one 
throughout, and possibly a contrast between the reading tests on 
the one hand and the usage and composition tests on the other. 

In the field of reading many different batteries of tests have been 
published. Each author analyses the total complex of reading 
skills into different a priori components, for none of which is any 
empirical justification offered. For example, Burt and Schonell 
provide tests of word pronunciation, continuous prose, speed and 
comprehension. Gates’s tests for Grades 3 to 8 claim to measure 
Reading to Appreciate General Significance, to Understand 
Precise Directions, to Note Details, etc. Trigg’s series for Grades 
7 to 12 includes Vocabulary, Visual and Auditory Comprehension, 
Rate of Reading three types of material, and two tests of Word 
Attack, Hall and Robinson (1945) state that the correlations be- 
tween Gates’s tests, which are supposed to measure distinct skills, 
are as high as those between tests by different authors which are 
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supposed to measure the same skill. When moderate or low 
correlations are found among tests aimed at different aspects of 
reading, this is often due merely to the unreliability of the tests, 
and not to any real distinction between those aspects. 

Mechanical Reading. The existence of only moderate cor- 
rélations between mechanical (e.g. word-pronouncing) and silent 
reading or comprehension tests is fairly well established; but we 
have already suggested that this may be due mainly to the higher 
&-content of the latter. Vernon (1938) carrelated several reading 
tests and teachers’ marks among Scottish primary school pupils, 
and found his Graded Word test to have a higher reading factor 
saturation than either a comprehension or a speed test. Within 
the field of mechanical tests, there is some evidence that recogni- 
tion tests differ from pronouncing ones. ‘Thus Dunlop (1942) 
reports a correlation of -83 between Vernon Word Recognition 
and McLaren Word and Picture Matching tests, among 6-year 
children, but correlations of -64 and -67 between these and the 
Burt-Vernon Graded Word tests. There is as yet no justification 
for subdividing oral reading into speed vs. accuracy factors, or 
into ability with regular phonic and irregular words, or into 
pronunciation of isolated words vs, complete sentences, though 
these are all possibilities. 

Silent Reading. American investigators seldom use mechanical 
(individual) tests, but are more interested in differentiation among 
silent reading tests. There is fairly strong evidence for partially 
distinct speed of reading, vocabulary or word knowledge and 
comprehension of sentences and paragraphs, factors at the high 
school and college levels, though at the same time there is a strong 
general factor. Gates (1921) quotes correlations for several groups 
of children (eight to fourteen year) in a single school for four 
reading comprehension te: 
bulary, and a group intelligence and a directions, tests, The 
averaged figures, shown in Table VI, indicate that all types 
measure much the same thing at this level. Thus different 
comprehension tests correlate more highly with rate measures 
(from other tests) and with intelligence than they do with one 
another. However, both rate and vocabulary show some specific 
overlap, i.e. they constitute partially distinct group factors. And 
another test (omitted here) based on reproduction of material read, 
Save very low correlations with the rest. 


sts, three rate tests, one oral, two voca- ` 
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Hall and Robinson (1945) claim to have separated speed, 
vocabulary and accuracy in a study of 100 college students, to- 
gether with an independent factor of ability to read and under- 
stand charts and tables. And they criticize reading tests which 
commonly involve a mixture of factors. Langsam (1941) similarly 
factorized twenty-one tests among 100 17-year students and ob- 
tained five factors which she identifies, not very convincingly, with 
Thurstone’s V, P, W, Nand J. The perceptual factor covered most 
of the speed of reading,tests, W the vocabulary tests, and J the 
tests involving logical organization and selection of ideas. Her 
first, general V, factor had about twice the variance of ati the other 
reading factors combined. 


e 
TABLE VI. AVERAGED CORRELATIONS BETWEEN DIFFERENT 
TYPES OF READING AND INTELLIGENCE TESTS (GATES, 1921) 


Compre- Rate Oral Vocabu- Intelli- 
hension lary gence 


4 Comprehension tests GED ESS 50 -51 +59 


3 Rate of Reading tests 5m (659) ee s -50 
1 Oral Reading test. 
2 Vocabulary tests 


Intelligence and Directions 
tests 


Davis (1944) also attempted to sub-divide reading ability and 
criticized tests which involve a mixture of factors. From a survey 
of the literature he arrived at the following a priori components: 


(1) Knowledge of word meanings. J 3 
(2) Recognition of appropriate meanings for words in particular 


contexts. $ NI 
(3) Following the organization of a passage and identifying 
antecedents and referents in it. 
(4) Recognizing the main thought of a passage. 
(5) Answering questions that are directly answered in a passage. 
(6) Answering questions that are only indirectly answered. 
(7) Drawing inferences from a passage about its contents. 
(8) Recognizing literary devices used, and getting the tone or 


mood. s f 
(9) Determining the writer’s purpose and point of view. 
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Tests of these components were devised and given to 421 
students without time limits, in order to eliminate any speed 
factor. On factorizing Davis claimed to isolate two main inde- 
pendent factors—word knowledge (chiefly in 1) and reasoning 
(chiefly in 6 and 7), together with several smaller factors corres- 
ponding to some of the other components. However, Thurstone 
(1946) re-analysed the data and showed that all correlations could 
be well accounted for by a single general factor, a blend of word 
Knowledge and comprehension. e 

Gther Reading Factors. Gans (1940) finds that success in 
selecting reading matter for help in solving a problem is separable 
from comprehension. Feder (1938) claims that reading factual 
material for information, and reading for inference, are distinct. 
But, in common with many other investigators, he fails to show 
that these are consistent factors in themselves. Artley (1943, 
1944) reviews researches on reading tests in different fields of 
knowledge and concludes that although they show considerable 
overlapping, there are wide variations between pupils’ or students’ 
abilities in different fields, Here, too, the unreliability of the tests 
has seldom been properly controlled. Artley himself obtained a 
correlation of -785 between general reading vocabulary and voca- 
bulary in a special field—Social Studies. It is noticeable also in 
Hall and Robinson’s research, which included reading tests in 


Pupils or students’ knowledge. All their marks are likely to be 
greatly influenced by a general vocab 
ability at such tests (cf. the discussion 
Pp. 76f) 


Analyses of Educational Attainments 47 


Conclusions. No further factorial evidence seems to be avail- 
able regarding the practical subjects that Burt distinguished, 
though we shall see in later chapters that the k:m factor probably 
links up with scientific ability, and that there may be an aesthetic 
discrimination factor relevant to certain subjects. Fig. 3 attempts 
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to portray the main findings so far. Although the picture that it 
gives of mental structure is certainly an advance on Fig. 2, it still 
cannot hope to do justice to the complex interconnections of 
different subjects, and to their variations among groups of different 
educational levels, or taught by different methods. 

Note that g, X and other relevant personality, interest and 
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physical factors, together with ved are placed in a central ‘com- 
plex’ which constitutes general educational ability. This affects all 
branches of all subjects. The influence of particular interests, 
traits or physical conditions on particular subjects, could not be 
shown. V:ed subdivides into v and n, which branch into the vari- 
ous linguistic and mathematical-scientific subjects. Each such 
subject, it may be assumed, would yield its own small group 
factor if appropriately investigated. An attempt has been made 
to place the more specific attainments, those which are usually 
least“dependent on general educational ability, furthest from the 


centre; also to place furthest apart those attainments which tend 
to show the lowest correlations. 


iN 


CHAPTER V 
INTELLECTUAL FACULTIES 


Abstract. The general conclusion of this chapter is that, though 
small group factors can be isolated fairly easily in many types of 
specialized cognitive tests, no intellectual faculties befond g and v 
are yet established as having much educational or vocational im- - 
portance. W (word fluency), F (ideational fluency), I or L (in- 
ductive or logical reasoning) and many other minor factors have 
been partially separated in selected groups, though often coalescing 
into g + v in other researches. The f group factor may be linked 
with certain personality trends. If there is a separate imaginative 
or creative ability, it seems impossible to measure it. The dis- 
covery of other abilities may well be stimulated by clinical re- 
search with mental patients, but so far there is no evidence that the 
commonly used deterioration and other clinical tests measure new 
factors. Apart from rote-memorizing and attention-to-directions 
group factors, memory and attention must also be regarded as 
unsubstantiated faculties» 

Introduction. Most modern psychological textbooks avoid 
using such terms as memory, attention, imagination, reasoning, 
etc., as though these constituted separate faculties. But we have 
seen that they still bulk largely in the layman’s and the education- 
ist’s discussion of human mental make-up. Very commonly, for 
example, objections are raised to objective or new-type attainment 
tests because they are said to measure memory for fatts, and fail 
to bring out the ‘understanding of principles, or the capacity for 
applying knowledge, or the originality and constructiveness, 
which find expression in the old-fashioned essay examinations. 
Similarly these tests and ordinary timed verbal intelligence tests, 
applied at eleven years, are said to select the ‘slick’ type of pupil 
for grammar school education and to handicap the pupil with more 
profound intellectual qualities who would make a better scholar. 
While it is by no means proven that none of these ‘types of mind’ 
or faculties exist, the evidence summarized below shows that they 
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should not be accepted until experimental.demonstration is forth- 
coming. Indeed the only thoroughly established ability of this 
kind, over and above g, is the v-factor, which we will consider first. 

V factor and its Sub-divisions. Verbal ability is usually in- 
extricably bound up with education, and impregnated with our 
X-factor. For example two representative groups of Army recruits, 
totalling 1,570 men, took a test designed to measure v, a test of 
clerical ability and an arithmetic-mathematics test; they were also 
assessed for education received. The inter-correlations of these 
four‘measures were all so high and so similar, lying between -753 
and -807, that they obtained almost identical factor loadings. 
Since the educational assessment was based merely on length of 
schooling, without any allowance for goodness of schools attended 
or for education obtained by private study, it would seem that v 
is determined more by upbringing than by any special linguistic 
aptitude (cf., however, p. 32). 

A number of American investigations do throw some further 


regard it as essentially verbal reasoning, though the Majority seem 
to agree that it is most effectively measured by straightforward 


less congruent, since it grouped the chief W tests under a ‘com- 
pletion ability’ factor. 


On the other hand Woodrow (1939b) foundno W, onlya single V, 
when he analysed fifty-two tests given to 110 students, although 
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several possible tests of fluency were included. Incidentally he 
also found that four of the five component tests of the George 
Washington Social Intelligence Test measure scarcely anything 
but V.1 The Division of Occupational Analysis’s (1945) list of 
factors also omits W; but some of its verbal tests obtained loadings 
on the perceptual and speed, as well as the V and general factors. 
The Spearman-Holzinger unitary trait study also yielded a single 
v among 13-year children, though some of the minor group factors 
such as imagination (7) or mental speed (a) may correspond te 
Thurstone’s W (Holzinger, 1934-5). Johnson and Reynolds 
(1941) analysed ten verbal tests and identified the two factors that 
emerged as flow of responses, and selection of correct responses, 
which appear to correspond to W and V. Thornton (1939) ñen- 
tions a fluency factor in his research into tests of persistence; but 
it seems to be merely the verbal and g element in the tests» he 
happened to use. 

More recently Thurstone (1948) has stated that three verbal 
factors should be distinguished, and suggests that they are selec- 
tively affected in different types of aphasia: ; 


V=understanding of verbal material. 
W=fluency in finding words to fit a restricted context. 


F=ideational fluency with words. 


This conclusion seems to fit in reasonably well with the several 
investigations of verbal tests listed below, though it is quite im- 
possible to reconcile them all. Since they involved extracting and 
rotating up to ten factors with populations of one to two hundred 
students, it would be hopeless to expect consistent results. 
Fruchter (1948) re-analysed the correlations of twenty of Thur- 
stone’s original tests and found, in addition to y, W and the other 
primary factors, a ‘speed of calling up pertinent associations 
factor in Inventive Opposites, Contrélled Association and Com- 
pletion tests. Taylor (1947) describes four verbal factors: 


Verbal comprehension (in Sames-Opposites, Sentence Com- 


pletion and Mixed Sentences tests); l ; 
Verbal versatility, or ability to express an idea by several differ- 


i i is battery and extra- 
1 The writer has found moderate overlapping between this x 
version-introversion questionnaires. Possibly this connects with Cattell’s 
(1936) statement that fluency tests give good predictions of ‘surgency’. 
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ent combinations of words (in Similes, Making Sentences 
from Initial Letters, Completing Unfinished Stories); 

Word Fluency, involving no reference to the meaning of words 
(in Anagrams, Words ending with ‘tion’, Words beginning 
with S, etc.); 

“ Ideational Fluency or production of words through meaningful 
associations (in Writing Themes, Names of Things that are 
Round, Adjectives to Describe a House, etc.). 

Finally Carroll (1941) discovers eight verbal factors, though one 
of these thay be largely motor (Maximum and Normal Speed of 
Oral Reading), and another is concerned with coherent oral ex- 
pression. He narrows down V to ability to learn and retain conven- 
tional linguistic responses (e.g. Grammar, Vocabulary, Spelling, 
Rkymes), and separates a Verbal Relations factor (Recognition of 
Roots of Words, Rearrangement of Syllables), and a third some- 
what amorphous factor. W he splits into A—speed of word 
association in a restricted context (e.g. Colour Naming, Suffixes, 
Dis-arranged Words), E—rate of production of syntactically 
coherent discourse (e.g. Number of Words in a Written Theme, 
Number of Relevant Words in Describing a Picture), and a third 
Naming factor or ability to attach appropriate names to stimuli, 

Oral Fluency. There seem to have been few factorial studies 
of speech characteristics apart from: Carroll’s. Gewirtz (1948) 
applied a number of oral fluency tests to thirty-eight 5-64 
year children, and suggests that there are d 
abilities in restricted and unrestricted contexts. 
relations between tests of his first type (Finding Rhymes, Words to 
Follow ‘In the —, Children’s Names, etc.) appear to be wholly 
attributable to g + v as measured by Stanford-Binet and a Mixed 


ifferent fluency 
Most of the cor- 


Gregariousness, Competitive- 
negatively with Social Appre- 


Breaves (1927), Cattell (1936), St 
etc. They have been interested 
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trait, which differentiates various types of mental patients, or 
which may be related to the popular conception of extraversion, 
rather than as an ability. The tests generally used for measuring it 
include Numbers of Words Beginning With—, Names of Animals, 
Numbers of Four-letter Words, Inkblot Responses, Associations 
with Pictures, Writing Themes, Normal Speed of Reading, efc. 
Several of these use non-verbal stimuli, though the responses are 
verbal, and. Taylor (1947) suggests that they are factorially com- 
plex. Certainly they correlate positively, but Hargreaves doubted 
whether they embody any common factor over and above g, spsed, 
and memory group factors. Holzinger (1934-5) included a number 
of them in the unitary trait study, and found that they resolved 
chiefly into g and v in one group of children. In other analyses he 
does claim a small distinctive imagination group factor ©. 
Another possibility which requires investigation is that W derjves 
merely from the simpler and more highly drilled, less g-saturated 
aspects of verbal ability, i.e. that W= V-g. : 
Creativeness. How about an imaginative, creative or construc- 
tive faculty? So far our evidence suggests that, apart from g and v 
there is nothing but this rather dubious fluency factor, or set of 
factors. An aesthetic discrimination or literary taste factor is 
mentioned below (p. 93), though this too appears to be very small 
when g and v are held constant. Knowledge of grammar, punctua- 
tion and sentence structure might constitute another partially 
distinct aspect of v. If all these were measured it is doubtful 
whether any further creative factor in the writing of English could 


be isolated. 3 

Obviously this answer will not satisfy teachers, but the difficulty 
hat it is almost impossible to 
The disagreement between 


(as opposed tothe more 


in distinguishing such a factor is t 
Measure creativeness reliably. 
examiners when marking the ‘higher’ (as A 
factual) qualities of essay papers is notorious. True, the sie, 
summarized in Chapter VIII shows that "some consensus © 
opinion as to the aesthetic merit of literary, visual or musical on 
Positions can be established, but no one has yet rae ss s 
method of investigation to pupils’ or students’ compositions. 4 hey 
are still marked mainly for g, v and knowledge factors. 

It is desirable here to distinguish between the pedagogical and 
Psychometric viewpoints. The writer entirely agrees that the A 
clusive use of objective intelligence and attainments tests for 


E 
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selection at 11+ has had a most undesirable backwash, stimulating 
teachers to confine their instruction to the sort of problems set in 
these tests. Everything possible should be done to encourage 
creative work, including the writing of continuous prose, and to 
develop inquiring and reflective attitudes of mind. But it is a 
cifferent matter to claim that any generally recognizable quality of 
originality exists and can be marked or measured, when it depends 
so largely on subjective taste. 

= We also lack any proof that such a quality is really relevant to 
secondary school work. The writer is very willing to be converted 
by experimental evidence, but doubts whether predictions which 
take creativeness into account would be any more accurate than 
predictions based on objective tests + ordinary school marks to 
cover the X factor. 

Reasoning. Spearman himself admitted specific overlap, or a 
small group factor, in logical reasoning, though this was based only 
on two tests, given to sixty-three students, and so need not be 
taken very seriously. Thurstone’s primary factors included one 
main type of reasoning denoted as J (Induction), and two more 
uncertain factors— D (Deduction) and R (Reasoning). In younger 
groups these appeared to amalgamate, and Thurstone’s followers 
such as Davidson (1945), Taylor (1947), Fruchter (1948) and others 
have usually found only one. Now reasoning ability is one of the 
commoner definitions of intelligence, and we would therefore ex- 
pect, if we allow a g factor, that g would include the whole of the 
variance of reasoning factors, together with part of that of V, N, 
S, etc. Both Holzinger and Harman (1938) and Eysenck (1939) 


show that this is the case, and that in only two tests out of Thur- 
stone’s fifty-six is there some s 
termed logical 


Analogies group factor, 


firmed in another experi 
induction and deductio. 
L. L. and T. G., 194 
(1939) claimed a logic: 
tests, but her method 
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describes a bipolar factor separating ‘Reasoning’ from ‘Appraisal’ 
tests in more than one investigation, but he does not give sufficient 
information about the tests or correlations to enable us to assess its 
Status. ; 

More convincing is the Division of Occupational Analysis’s dis- 
covery of a Logic factor in several verbal and non-verbal tests 
involving formal rational solutions to problems, in addition to their 
general factor content. But it is clear that the investigators rotated 
their factors in such a way as to minimize g. In the absence of 
published figures, we cannot tell whether a different set of réta- 
tions might not have absorbed most of this into g. “Since the 
General Aptitude Test Battery issued by the Division omits this 
factor, it is apparently regarded as having no vocational importance. 
The same may well be true in the educational field. A small 
reasoning or logic group factor could be isolated from specialized 
tests, but it would be unlikely to add anything to measures of g, v 
and x in the prediction of the reasoning ability desirable among 
secondary pupils or college students. 

U.S.A.A.F. Investigations of Reasoning. The most elaborate 
work is that of Davis and Guilford among U.S.A.A.F. candidates 
and trainees. Davis (1947) tried out fourteen tests designed to 
measure ‘practical judgment’ on 150 high school boys. Tests with 
mechanical or spatial content were, as usual, differentiated from 
verbal ones, but the latter appeared to yield separate group factors 
of: 

(1) Logical reasoning in Syllogisms, Reading or Arithmetic 

Problems. 


(2) General vocabulary. 3 
(3) Pure judgment and reasoning judgment. 


It is difficult to discern the difference between (1) ard (3), but 
Davis suggests that the latter requires the, calling to mind of 
pertinent information, in other words that Judgment involve 
something in the nature of Fluency. In default of analyses along 
with g and W tests, it seems unsafe to draw any conclusions. Davis 
stresses the complexity and the number of independent factors 
involved in reading, but this may be due merely to his use of 
Kelley’s factorial technique on highly unreliable tests.! Actually 


1 The Principal Axes technique inevitably turns every specific factor (includ- 


ing error variance) into an independent common factor. 
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the average correlation of tests falling within groups (1) and (3), 
namely -249, is scarcely any higher than the average of -223 
between these groups. 

Guilford and Lacey’s (1947) list of reasoning factors includes 
the following, in addition to V, N, mechanical, spatial and other 
influences in certain tests: 


General reasoning in mathematics tests and several verbal 
judgment and non-verbal tests. i 

Reasoning II in non-verbal analogies and Gottschaldt Figures. 

Reasonisg III in spatial reasoning tests and decoding (cypher) 
tests, 

Planning of routes on maps, through mazes, and in electrical 
circuits. 

Judgment in practical situations, making common-sense deci- 
sions. 


They point out that none of these conforms to logical categories 
like Thurstone’s Induction and Deduction. Considerable caution 
is needed in evaluating Guilford’s factors, since many of them were 
established only in smallish groups (2 to 300) of highly selected 
aviation students, and are of doubtful reliability (cf. p. 131). It is 
noticeable that in the larger analyses with populations of several 
thousands (which admittedly included few reasoning or judgment 
tests) only a single General Reasoning factor emerged, and even 
this tended to overlap with V. Only one analysis was carried out 
on a relatively unselected group of 689 high school boys. This was 
re-analysed by the present writer, using group-factor technique, 
and it was found that a g (with some 22 per cent. variance) ++ 
group factors for verbal, mechanical and spatial-perceptual tests 
(totalling some 12 per cent.) gave almost as good a fit as Guilford’s 
six multiple factors—V, Mechanical, Visualization, General 
Reasoning, Reasoning II and Judgment. 

Reyburn and Taylor’s Experiment. One investigation which 
appears at first sight to provide crucial disproof of a single g and 
evidence for several distinctive intellectual faculties, is that of 
Reyburn and Taylor (1941). Ten varied intelligence tests were 
analysed among 1,497 South Africans, aged 12-18, and although 
all correlations were positive, Many were so low that the 
only statistically satisfactory solution was a set of five multiple 
factors. The content of these was very puzzling, only one—a 
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verbal factor—appearing to coincide with the findings of other 
investigators.. Yet with so large a number of carefully tested sub- 
jects, we cannot afford to ignore this. The writer would suggest 
that the wide age range has distorted the coefficients, and is 
responsible for their generally low level. For example an Absurdi- 
ties test and Porteus Mazes which would normally correlate $o 
about -4 in a twelve-year group, correlates here only -047, prob- 
ably because ability at Absurdities may go on increasing up to 18, 
whereas ability at Mazes,may reach a maximum and even begin te 
decline before 18, Another disturbing condition is that about half 
the group was English-speaking, half Afrikaans. The correlation 
would be reduced if the former were superior in Absurdities, the 
latter in Mazes. $ 
Factors in Mental Deterioration. A case might be made out 
for a factor distinguishing what Cattell (1943) calls fluid and gry- 
stallized abilities. The former is an individual’s effective intelli- 
gence at new problems, which declines with age and is reduced by 
brain injury and other pathological conditions. The latter consists 
of long-established discriminatory habits which are less, or not at 
all, subject to deterioration. Since, however, the most representative 
tests of the former type are those most saturated with g such as 
Matrices, Abstraction, Wechsler Similarities, etc., whereas tests 
of the latter type include Vocabulary, Information, Comprehen- 
sion and Arithmetic, there do not seem to be sufficient grounds for 
positing any new factors beyond g, v:ed and (as will be shown 


in Chapter VII) speed. Indeed, susceptibility to deterioration 


might provide us with a useful external criterion, beyond the 
purely statistical one, as to which tests are most representative 
of g. 

N europsychiatric Faculties. This brings us to a thorny topic. 
What do the innumerable clinical tests employed by neurologists, 
psychiatrists and clinical psychologists measure? Neuropsychiatric 
literature is extremely prone to talk of such faculties as memory, 
concentration, conceptualization and orientation, and to state that 
one or more of these is affected by organic disorders, lesions or 
psychoses. Numerous unstandardized tests are employed in 
addition to scientific scales like the Wechsler-Bellevue, for example 
Serial Sevens, Bender Visual Gestalt, Vigotsky Blocks, Repeating a 
Story in Own Words, Naming Six Large Cities (cf. for example, 
Curran and Guttmann, 1945). It is argued that even though many 
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of these may not yield accurate measures of established factors, 
they provide valuable clinical insight into the patient’s disturbed 
intellectual functions. In the writer’s view (Vernon, 1949c), re- 
search with such tests should have considerable exploratory value 
in suggesting distinguishable mental capacities, but it should be 
followed up by factorial studies like that of Thurstone in the field 
of perception (p. 89). The two approaches should be comple- 
mentary. The argument that every patient is a different individual 
who cannot be measured and fitted into a neat factorial framework 
is dubious. For if it is true, then the clinician should not talk about 
conceptualization, concentration, etc., as these also differ in every 
individual. If he is going to use such concepts in describing large 
numbers of patients, their distinctiveness and consistency must be 
objectively demonstrable. There can be no doubt that many such 
testy measure nothing but g and v and valueless specifics. At the 
same time the factorist may have much to learn from the clinician. 
Many syndromes such as impaired retentivity for recent events, 
the decreased ‘insight’ of the paranoiac, etc., are thoroughly estab- 
lished and should provide pointers to factors which would be worth 
measuring (cf. Hsii, 1948), 

One attempt to analyse clinical tests is that of Halstead (1945, 
1947). He claims that thirteen tests given to fifty patients yield 
four distinctive aspects of intelligence, or else (applying Holzinger’s 
group-factor technique) one general and three group factors. The 
general factor which he calls C (Central Integrative) obviously 
corresponds to g, being highest in a group intelligence test, an 
Abstraction test, Speech Discrimination, etc. But it seems doubt- 
ful whether the other factors are Statistically significant. They have 
no clear psychological Meaning, and Halstead fails to show that 


they are differentially affected by different psychopathological 
conditions: 


Other Analyses of Intellectual Qualities. An original view of 


intellectual qualities has been put forward by Meili (1946; cf. also 
Myers, 1947), Though using Thurstone’s centroid technique, his 
factors are somewhat uncertain, being based mainly on individual 
tests given to several small groups of subjects (thirty to fifty) of 
of various ages. He rejects g, and is not concerned with such 
factors as V, N, or S, which he regards as arising from the external 
characteristics of the tests. Instead he finds four main factors 
which are aspects of, or together make up, intelligence, namely: 
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(1) Plasticity: the breaking down and reorganization of struc- 
tures. This seems to resemble Thurstone’s second per- 
ceptual factor closely (cf. p. 89).. 

(2) Complexity: ability to realize complex intellectual structures. 

(3) Fluency. This corresponds to Thurstone’s unrestricted or 
ideational F factor. Fi 

(4) Globalization: uniting separate data into a single whole, an 
essentially creative capacity. > 


It will be interesting to see whether these can be confirmed”by 
more extensive investigations. 4 
One other tentative approach to the analysis of intellect which _ 

deserves mention is that of Earle (1948). He has not used factor 
analysis, but has compared the results of numerous intelligence 
and other tests given at 10 to 13 years with the subsequent per- 
formance of the:children at various types of secondary school 
course, and so arrived at a working classification. Though re- 
garding g as the main source of individual differences in scholastic 
abilities, he considers that the following sub-types or group factors 
tend to differentiate, probably under the influence of interests and 
temperament. 


(1) Knowledge of words and comprehension of sentences. 

(2) Logical reasoning, seeing relations between objects 
ideas, recognizing and describing attributes of persons or 
things. 

(3) Seeing relations between, an’ 
(i) numbers, (ii) shapes. ; 

(4) Comprehending the structure and functions of shapes, 
mechanisms and other objects; dealing with practical 
problems. ə 


Clearly Nos. (1) and (4) correspond to v andk:m factors. No. (3) 
links k and n. This seems doubtful at so early an age, though it 
does occur among older pupils and students (cf. p. 73). No. (2) 
is the faculty which has been criticized in this chapter. A large- 
scale analysis of Earle’s Duplex tests, which yield scores for g and 
for these types of ability, is badly needed. : 

Memory. One of the most popular faculties, both among 
educationists, psychiatrists and laymen, is memory. Spearman 
(1927) admitted that g enters into many learning and reproductive 


and 


d carrying out operations with: 
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activities, but regarded retentivity as an entirely distinct mental 
function. Moreover, he did not expect overlap between retentivities 
for different materials, except in so far as the materials were 
closely similar. Ingham (1949) gives a valuable review of the com- 
plexities of measuring learning ability and retentivity, and con- 
cludes that Spearman was not far wrong. But g does enter into all 
memory activities, particularly when the material is meaningful, 
and a fairly broad rote memory factor can be recognized in 
addition. =è $ 

As early as 1920, Smith and McDougall found correlations of 
-53 between two tests of logical memory, and -61 between two of 
habit memory, but coefficients around zero between tests of these 
two types. A memory for prose test partook of both types. They 
claimed thus to substantiate Bergson’s distinction between habit 
and meaningful memory. ‘These results were obtained with 
forty-one students only and they have not been confirmed, but it 
is likely that the correlation between their logical tests represents g, 
and that the overlap between their habit tests is due to a separate 
factor. Several subsequent researches cast doubt on the existence 
of any memory factor or factors. Thus in Holzinger’s unitary trait 
study (1934-5) the memory tests resolved into g,v and a (mental 
speed). But ina later publication Holzinger (1938) demonstrated a 
group factor in immediate memory span for words, sentences, 
digits and pictures. Eysenck and Halstead (1945) applied fifteen 
of the commonly used clinical tests of ‘memory’ to sixty mental 
hospital patients, and found that a single factor, identical with g, 
accounted for the whole of their overlapping. However, this result 
was at least partly due to the unusual heterogeneity of their patients 
ing. Again, Bryan (1934), working with 200 kindergarten children, 
found as close correlations between eleven memory tests and 
Stanford-Binet and Vocabulary as among the memory tests 
themselves. ° i 

Nevertheless a large number of investigators have obtained clear 
rote memory group factors, including Thurstone, L. L. (1938a, 
1940), Thurstone, L. L. and T. G: (1941), Woodrow (1939b), 
Carroll (1941), Wittenborn (1943), Taylor (1947), etc. Most of 
these factors were based on digit or sentence span, paired associate 
or recognition tests, none of which bear much resemblance to 
learning, retention, or recall in everyday life. Kelley’s (1928) 
factor, found among 13-year, 9-year and kindergarten groups, 
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extended to verbal, numerical and visual material. But the tests for 
each type of material were exactly similar in form. Words, num- 
bers or pictures were shown to the children, who then picked out 
which items they had seen on their answer sheets. Anastasi (1930, 
1932) obtained a small but significant group factor in several paired 
associate tests and in recognition tests of previously presentêd 
words, syllables and forms. But as soon as she tried to extend this 
to learning and retention of logical material, delayed memory for 
words, reproduction of movements, and Seashore’s tonal memory 
test, the correlations mostly became negligible. Since she worked 
with somewhat homogeneous groups of college students, none of 
the tests except those for logical memory showed appreciable cor- , 
relations with a verbal g test. In the U.S.A.A.F. also rote memory 
factors were readily established, but a delayed memory test for 
logical and visual material showed no loadings on these factors, 
only on V and Visualization (k). . 

The broadest factor so far described is that of Ingham (1949), 
who gave eight paired associate tests, individually, to eighty Army 
recruits, and included nonsense and meaningful words, pictures 
and forms. Each test was scored in four ways, for immediate 
memory, for speed of learning, for retention after thirty minutes 
(given a constant amount of initial learning), and for time saved in 
relearning. A single memory factor, as well as g, ran through all 
these scores. The average variance of both factors was 12 to 
13 per cent., but g was more prominent in learning and immediate 
memory scores, and the memory factor more important 1n reten- 
tion and saving scores. 


Some sub-divisions of rote memory have been discovered. Thus 


Guilford and Lacey found different factors among paired associ- 
ates tests and tests involving study and immediate recall of details 
on maps. The Thurstones (1941) state that the factor ifvolved a 
tests involving temporal sequence differs from that in oe 
associates. Carlson (1937) attempted to study rote vs, logical, a 
visual vs, verbal memory factors. But as his material Cart’ 
solely of word-recognition tests, the general factor E which 
he arrived and the sub-factors for ‘vocal’, ‘visual and sivas 
tests can only be accepted as specialized rote factors. Similarly 


Brener (1940) factorized seventeen tests of memory Sen anon 
forty stud tained, in addition to a prominent genera 
shake f which seemed 


factor, some rather doubtful group factors, one 0 
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to be mainly verbal, another visual or spatial. Some further work 
on sensory memory tests is described in Chapter VIII. 

Attention. Finally we must consider the evidence for a faculty 
of attention or concentration. At one time Burt (1909) and Mc- 
Queen (1917), finding that tests which demanded the greatest 
attentiveness tended to have high g-saturations, concluded that g 
and attention are identical. Holzinger (1934-5) did discover a 
group factor (¢), over and above g in certain tests involving listen- 
ing to the experimenter and following directions. For example, the 
tester reads ‘D 23’ and the subjects count mentally and write down 
F I, the second and third letters after D. He admits that it was the 
smallest of his group factors, and the most irregular in his different 
groups of subjects. Woodrow ( 1939b) used oral and written direc- 
tions tests, and found a distinctive factor in these and in tests of 
arithmetic, cancellation and copying figures off a chart. This would 
appear to bear a close resemblance to Thurstone’s perceptual speed 
(clerical) factor, discussed later, However, Wittenborn (1943), in 
an investigation with 175 Air Corps trainees, included several 
complex oral directions tests demanding sustained concentration. 
For example, lists of digits were read out (on phonograph records), 
testees having to write a cross if the first in a list was the largest, 
the second the smallest, or to make appropriate responses to odd 
or even numbers. Other tasks were based on lists of vowels and 
consonants. These tests gave a promirent factor distinct from P, 

¿Sand N. How far it involves g and v was not indicated. 

Much the same concept is implied by Guilford and Lacey’s 
work on ‘integration’. They claim not one, but three integration 
factors in tests where subjects have to learn a number of rules and 
hold these in mind while deciding the appropriate responses to 
problems. Thus the best test of the 71 factor shows aircraft carriers 
flying various flags, and the carrier from which planes should take 
off depends on the numbers of flags, the directions of the ships 
relative to the wind, and other instructions. [2 factor appeared ina 
printed directions test and other tests involving quick adaptation 
to new instructions. 73 occurred in certain planning and reasoning 
tests where numerous considerations had to be integrated. It 
appeared to represent breadth rather than strength or flexibility of 
attention. While we can admire the ingenuity of the tests devised 
by USAAF, psychologists for measuring the mental traits con- 
sidered important in aircrew, and the care with which they were 
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validated, we can legitimately ask for confirmatory evidence from 
less highly selected groups before we accept these as distinctive 
and practically useful group factors. 

Many teachers appear to think that most of their troubles would 
be ended if they could develop their pupils’ attentiveness. Prob- 
ably however what they mean by attention can be resolved into: ə 

(a) g +v or in other words the children’s mental ages. For it 
is well known that lack of concentration is most prominent in 
younger classes. s a 

(b) the X factor, which includes the pupils’ interests in edch 
school subject. As already pointed out this is a function of the 
school or teacher as well as of the pupil, and wandering of attention 
should often be attributed to the teacher’s failure to make a subject j 
sufficiently interesting rather than to the pupils’ traits. 

(c) a relatively tiny group factor such as that isolated by Hol- 
zinger, Woodrow and Wittenborn. It is conceivable that their 
tests might give valuable predictions of educability, over and 
above g and v-ed tests, but the only evidence so far is negative 
(Wittenborn & Larsen, 1944). More probably their attention-to- 
directions group factor is, like the rote memory factors, confined 
to too narrow a type of test to have much spread to other attention 


situations. 


CHAPTER VI 


VERBAL AND NON-VERBAL FACTORS IN 
INTELLIGENCE TESTS 

Abstract. All intelligence tests measure some group factor or 
factors based on their type of material, in addition tog and specifics. 
V :ed factor is very prominent in verbal tests, in Stanford-Binet and 
Terman-Merrill, but this enhances their predictive value for most 
educational and occupational purposes. Spatial or k tests are 
nowadays distinguished from intelligence tests, but there is no 
clear dividing line, and non-verbal & tests—abstract or pictorial 
—usually show a small spatial-perceptual component. At the same 
time the group factors are seldom sufficiently marked to justify 
using intelligence tests as measures of abilities other than g. The 
Stanford-Binet, for example, does not give reliable diagnostic 
indications of verbal, numerical, memory, spatial or other abilities. 

Research findings are often contradictory since the incidence of 
spatial or perceptual factors varies with the selectivity of the sub- 
jects and their sex, the ease of the tests, and with the pre-supposi- 
tions of the factorist. The theory that k does not differentiate till 
about 14 years is not borne out. K contrasts strongly with n in 
children and dull adults, but they tend to link up at high-grade 
levels, presumably because of scientific education. The only sub- 
factor branching off from k that is well established is perceptual 
speed in matching or identifying details of shapes or pictures. This 
may link with the clerical group factor—a sub-division of v, though 
both are rather unstable. Perceptual speed is not responsible for 
the non-g content of tests like Progressive Matrices. 


Most psychologists were 
verbal problems until the 


are usually reached through verbal sym- 
bols and concepts. Nevertheless it was realized quite early that 


performance at such tests was influenced by linguistic develop- 
ment and education, and many investigations such as Gordon’s 
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with canal boat children proved this. Hence scales of performance 
tests, and group tests based on shapes or pictures, were con- 
structed as early as 1917 in an attempt to give the non-English 
speaking or illiterate recruit, the deaf child and other verbally 
handicapped individuals, a fair chance. Kelley showed in 1927 
that an ordinary verbal group test measures to the extent of 90 
per cent. the same thing as a summed battery of scholastic achieve- 
ment tests. This might of course be attributed to the achievement 
tests depending over-much on g, particularly if the instructions 
and the form of the objective items are unfamiliar to the pupils, 
Hence correlations with attainment as measured by ordinary 
school marks are usually somewhat lower. But at least as likely an 
explanation is that intelligence tests involve much the same 
linguistic capacities as achievement tests. A study of conventional 
intelligence test batteries shows indeed that they often include 
vocabulary and sentence completion items, that is precisely the 
same kind of material out of which silent reading tests are com- 
posed. 

Recent research indicates that nearly half the communality of 
many group verbal intelligence tests consists of v rather than g, 
but that some types of test are less v-saturated than others (cf. 
Vernon, 1947b). Abstraction tests, for example, whose problems 
are based more on letters and on word forms than on meanings, 
seem to have a g-variance of,some 65-75 per cent., andv only 5 per 
cent. Number Series tests, again, have quite small n-loadings. 
Presumably the size of the group factor depends on the extent to 
which it is fostered at home or in school. Hence it is large in any 
test depending on comprehension of words and sentences. And it 
might become equally large if schools set out to train pupils in 
answering Abstraction and Number Series problems. There can 
be little doubt that the coaching for group intelligence tésts which 
is nowadays so common ‘at 11+ has-reduced the g-variance and 
increased the group-factor variance of such tests. 

Factors in Non-Verbal Intelligence Tests. Next the problem 
arises whether similar group factors are present in non-verbal tests. 
Here there is a vast quantity of somewhat conflicting evidence, and 
it will be far from easy to pick our way through it to clear-cut con- 
clusions. As already mentioned, Kelley (1 928) found a distinctive 
factor both among 13- and 9-year pupils in two tests involving 
memory for shapes and two based on turning shapes around 


¢ 
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imaginally. Even among kindergarten children, though the results 
were less definite, a spatial factor appeared in a test of memory for 
shapes and a simple formboard. The Thurstones (1938a, 1941, 
1948) included numerous spatial tests in their primary mental 
abilities investigations and obtained a factor which they call S, 
obviously the same as El Koussy’s k, even as early as 5 to 6 
years. It was most marked in tests involving imaginative mani- 
pulation of shapes. Both Eysenck and Holzinger and Harman, 
reworking Thurstone’s figures, agree that such tests measure gand 
kor S, the average variance of both factors being in the neighbour- 
hood of 25 per cent. 

At the same time the distinction between spatial and other non- 
verbal group tests is by no means as clear as El Koussy (cf. p. 17) 
believed. Tests such as Cube Counting and Paper Formboard 
appear to involve imagination of shapes, and have obtained large 
R-loadings in many experiments. Yet they were originally de- 
signed as parts of the Army Beta test for measuring intelligence 
non-verbally, and were included by Stephenson (1931) in the 
battery whose inter-correlations he attributed solely to g (cf. p. 17). 
Emmett (1949) recently reanalysed El Koussy’s figures and showed 
that several visual tests, together with mechanical tests and wood- 
work marks, have almost as high k-loadings as the original eight 
tests. Though Alexander (1935) and Drew (1947) accept Spear- 
man’s and Stephenson’s assumption that non-verbal g tests de- 
pend only on g, their results accord at least as well with the view 
that they contain a small spatial component. In numerous analyses 
in the British Services, the Progressive Matrices test and the 
National Institute of Industrial Psychology’s Group Test 70 
have obtained small k:m loadings (cf. Tables V, VII, IX). Prob- 
ably it is the spatial and not the mechanical aspect of k:m ability 
which is involved (cf. Table TX). Williams (1948), on the other 
hand, found distinctive verbal, mechanical and spatial group 
factors in his investigation of 12 to 14 boys, but his non-verbal g 

tests were loaded only with g: 

Spatial Factors in School Subjects. Another dubious point is 
whether k enters into geometry or other school subjects. From his 
extensive work in developing intelligence tests for the selection of 
college students, Brigham (1932) was one of the first to note verbal, 
numerical and spatial group factors in such tests. And he found 
much higher correlations between spatial tests and subsequent 
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performance in drawing (descriptive geometry and mechanical 
drawing) than between spatial tests and other subjects, or between 
verbal tests and drawing. Similarly Smith (1948), who used 
several of El Koussy’s tests and obtained results analogous to his 
among 100 13-year-olds, claims k-loadings in art, geometry and 
engineering drawing, though not in handwork, marks. On the 
other hand Blackwell (1940) and Holzinger and Swineford (1946) 
do not find any overlap between spatial tests and geometry, apart 
from g. The one point gn which all workers agree is that girls or 
women are poorer than boys or men in k, and the sex difference 
has been used as a sign that a test is measuring this factor. 

Reasons for Divergent Results. No doubt different methods _ 
of factorization and different arbitrary rotations account for Some’ 
of the discrepancies in the results of different investigators. The 
use of groups of subjects of differing degrees of selectivity valso 
probably plays a part. It is not unlikely, for example, that k is less 
differentiated from g among duller testees. Subjects of different 
sex, age, training and general ability may often tackle the same test 
items by different methods (cf. Thurstone 1938b). Thus many of 
the items in the Matrices test can be done largely by verbal logic, 
or by spatial imagery, and possibly even by visual matching (per- 
ceptual factor). Hence the irregular appearance of k in tests sup- 
posed to measure g is not surprising. ; 

Space Factor and Age, The role of the age factor has been 
stressed by Slater (1940, 1941, 1943), though again there is no 
general agreement. In a series of researches he applied verbal, 
non-verbal g and spatial tests to 82 and 211 children aged 
11+, 161 aged 13+ and 89 trade apprentices aged about 
18. In the third group there was a clear k-factor, which extended 
through almost all the spatial, and the mechanical, tests. But he 
claims that, in the younger groups, the only factor besides g isa 
verbal one, and that spatial tests measure just the same thing as 
non-verbal g tests. He concludes that k tests cannot be used at 
11, or even at 13, for selection for technical education. It 
should be recognized however that, just as in pea re- 
search (cf. p. 17), it would be equally legitimate to E as 
Separate group factor in the spatial and the oN : Dies : 
Although he obtained a good fit with two factors a i Sg tae 
to stop us extracting a third. Adcock (1948) has ae ar E 
both Thurstone’s multi-factor, and group-factor, tec d 


68 The Structure of Human Abilities 


finds clear v and k factors in addition to g. According to his results 
the non-verbal g tests, Matrices and Group Test 70 Parts I and 
IL, have some 7 per cent. of k-variance, whereas the definitely 
spatial tests have about 16 to 30 per cent. variance. Emmett (1949) 
similarly, after grouping together several of Slater’s tests, found 
three statistically significant factors, and some though not all of the 
non-verbal g tests certainly involved k. Another feature of Slater’s 
investigations was that his younger groups consisted half of girls, 
half boys, whereas his older group was all male. Several researches, 
described later, indicate that k is less differentiated in femiules than 
in males. : Emmett suggests also that most of the tests were not 
well suited to young children. j 

The evidence for a space factor around 11-13 and earlier, from 
El Koussy’s, Kelley’s, Thurstone’s and other investigations is indeed 
overwhelming. Kerr (1942), Dempster (1948), and Williams (1948) 
may also be cited. Peel (1949) gave nine tests to three groups of 
70-80 boys and girls aged around 11, 12} and 134, and in 
each group his second, bipolar, factor contrasted two performance 
tests with three verbal tests. A spatial test based on detecting 
faults in patterns, not on manipulation of shapes, approximated to 
the performance type, and two non-verbal g tests were usually 
intermediate between the verbal and practical-spatial. Emmett 
(1949) factorized four verbal and numerical, three non-verbal and 
two spatial tests among 178 eleven- and twelve-year boys. 
The spatial tests yielded a distinct factor, one which involved 


three-dimensional judgments gaining a higher loading (and 
less g) than the other which involved two- 


was smaller and rather irregular. 
g and k variances were 24-3 and 
12-1 among boys, 26-6 and 9-4 among girls. 

Pictorial Intelligence Tests, Though several of Mellone’s 
e.g. Mirror Images and Cube 
ow k-loadings were non-spatial 
» yet others which were more pictorial 
measure k as well as g. No in- 
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vestigation seems to have discovered a pictorial factor in pictorial 
group intelligence tests (though Burt suggests the existence of such 
a factor in certain performance tests; cf. p. 109). Most of the 
picture tests like Mellone’s, Otis Alpha, Cattell, etc., seem to be 
rather unreliable measures of g and to possess large specific com- 
ponents, or else to evoke a small amount of k. a 

Perceptual and Clerical Group Factors. Another possibility, 
which we must next consider, is the existence of a perceptual 
group factor, distinct from k—a specialized ability to solve prob= 
lems based on abstract diagrams, which might enter into such non- 
verbal g tests as Progressive Matrices. a 
i AP factor of Perceptual Speed was first described by Thurstone 
in tests involving rapid visual inspection and identification of 
letters, numbers, words and shapes. In a more extensive study of 
this factor (1938b), the most saturated tests included selecting 
common word associations, classifying words under headings 
(e.g. flowers, clothing, etc.), picking out the highest number in a 
column, and other tasks of a kind often included among clerical 
tests. Cancellation and arithmetic tests also obtained small load- 
ings. But non-verbal material of the same type seemed to be more 
saturated with S (k) than P. Other investigators, however, such as 
Dvorak (1947) and Guilford and Lacey have concluded that the 
factor is most prominent in tests involving the matching of pictures 
or shapes. In fact the Division of Occupational Analysis postu- 
lates two separate factors—Q in tests of a clerical type, simple 
arithmetic and coding, and P in matching figures or distinguishing 
slightly different pictures and shapes. A few tests such as com- 
paring lists of numbers partook of both P and Q, but there is as 
yet no evidence as to whether they are linked. s 

In a British Army investigation, a clerical factor was discovered 
as an offshoot of v:ed. An analysis of twenty tests among 300 
clerks yielded g, k:m, v and z factors with variances 28-3, 3-3, 5-4 
and 6-7 per cent., and a clerical factor with variance 7:7 per cent. 
in four clerical tests and in the A.T.S. spelling and arithmetic tests 
which involve the checking of right answers. Jorgensen’s (1934) 
correlations among some 150 college students likewise indicated a 
small group factor in spelling and in four clerical subtests, beyond 
gandv. But in more heterogeneous groups the factor seems to be 
absorbed into ved. For example, there was no sign of it in the in- 
vestigation of Table V, although three tests suitable for measuring 


F 
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it (together with two perceptual g tests) were included. It is note- 
worthy that in the Thurstones’ researches with fourteen-year-olds, 
P appeared only irregularly and tended to merge into Word 
Fluency or Space. Again in an analysis of sixteen of Thurstone’s 
Primary Mental Abilities tests by Goodman (1943), the tests 
meant to measure P resolved chiefly into V or S. 

A fundamental investigation by Thurstone into types of per- 
ception does not seem to throw any light on the nature of non- 
verbal P. It is described in Chapter VIII. But several other re- 
searches must be mentioned here. 

Other Investigations of the Perceptual Factor. Ten non- 
verbal reasoning tests were given by Blakey (1941) to 286 15-18 
year pupils and five factors extracted. There was no sign of a k 
factor, although several of the tests were spatial in character. The 
only clear factor besides g was one that differentiated four tests 
involving matching or identification of patterns and pictures, anda 
type of substitution test, from the rest. Probably the tests, and 
Blakey’s other factors were rather unreliable, and they may have 
been distorted by the wide age range. Thus no firm conclusion 
can be drawn here, except that non-verbal P and k are somehow 
connected. 

During the war, an investigation was made of fourteen tests 
among 500 candidates for naval radar which included, besides 
verbal tests, Matrices, Group Test 70 (all parts), an Oscilloscope 
Reading test involving rapid matching of oscilloscope pictures, 
Test 2 Mechanical and four spatial tests. The intercorrelations of 
these eight tests were almost wholly accounted for by g and k:m, 
but there was a slight contrast between the first three and the last 
five, which might mean either that Matrices, 70 and Oscilloscope 
involve less k than the others, or that they bring in a distinct P 
factor. As Matrices and Test 70 Parts II and III are given with 
fairly generous time limitsc(Matrices often with no limits), this 
might well obscure any factor of perceptual speed. However, Test 
70 Part I and Oscilloscope Reading are speeded tests, and these 
showed no residual correlation. 

Correlations between thirteen American tests for radar operators 
among 100 trainees are given by Lindsley (1943). An analysis by 
the writer suggests that, in addition to a prominent general factor 
and a group factor in several tests of graph reading, there is a 
distinctive perceptual factor in the Oscilloscope Reading test men- 
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tioned above and in three tests depending on identification of 
irregular visual shapes. 

In another analysis of R.A.F. aircrew aptitude tests, mostly 
borrowed from the U.S.A.A.F., a definite perceptual or observa- 
tional factor appeared in three tests involving matching of aerial 
photographs, of photographs and maps, and of aircraft silhouettes, 
and (to a smaller extent) in table and dial reading tests. It also 
showed some loadings for mechanical information and compre- 
hension tests and aviation information. This very wide content 
suggests that it derives mainly from aviation interests and 
A.T.C. training, rather than from any primary psychological 
aptitude. However, the same three matching tests form the basis 
of Guilford and Lacey’s perceptual speed factor, and in the 
U.S.A.A.F. it is linked, not with aviation interests, but with 
several tests usually presumed to involve k. No test of the clerical 
type had loadings of more than about -30 with this factor. 

The most relevant study is one where seventeen tests were 
analysed among 645 ground recruits—a group fairly representative 
of the general population. This included Test Obs-C—the 
aircrew test of matching silhouettes (which closely resembles the 
Division of Occupational Analysis’s P tests)—two tests of dial or 


scale reading, two non-verbal g tests, a clerical and three k tests. 


The results shown in Table VII do indicate the presence of a 
but its variance is so small (an average 
of 3 per cent. in seven tests, whose mean communality is 68 per 
cent.), that it cannot be regarded as having much statistical or 
practical significance. Ina subsequent group-factor analysis of the 
same battery, not yet published, it appeared chiefly in scale and 
dial reading tests, clerical and Obs-C. Though not present in 
non-verbal g tests, it was shown to affect a few of their easiest items. 
We may conclude then that there probably is a non-verbal per- 
ceptual or observational factor in certain visual matching or identi- 
fication tests, which may be fairly prominent among selected groups 
but is very small in unselected populations. It would appear to be 
an offshoot of, or to be linked with, both the spatial or k factor and 
the clerical sub-factor of v-ed. But it probably does not play any 
important part in non-verbal or perceptual g tests, particularly 
when these are of a power rather than a speed variety. 
Conclusions Regarding Intelligence Tests. In general, no 
test can claim to measure nothing but g (and error variance). The 


perceptual-clerical factor, 
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type of material used for expressing intellectual ability, whether 
verbal or non-verbal, always imposes some group factor, though 
this may be fairly small if the material is unfamiliar, as in verbal 
abstraction tests and visual tests of the Matrices type. Possibly 
pictorial tests are least likely to be biased either verbally or spatially 
but so far these have been constructed only for young children, 
and seem to be too unreliable to provide a promising source of g 


TABLE VII. ANALYSIS OF 17 TESTS AMONG 645 RAF. GROUND 
RECRUITS. (Rotated Centroid Factors; loadings less than -075 omitted) 


v Pere. k Mech 


Inf, 


Cajfculations test e ° 

Arithmetic test “62 67 “08 | - 
Test 119, Scale and Graph Reading] «88 -33 -87 
Test Ins-A, Dial Reading -80 +32 -74 


Test V-4, Verbal “67 +40 +40 -09 +78 
Gen-A1 and SP 14, Spelling tests *59 +31 +45 “65 
Gen-A2, Reading Comprehension | -59 -29 - 38 -14 11751363 
SP Test 21, Clerical e E Sy) *79 


Obs-C, Matching Aircraft Sil- 


houettes +59 +34 -30 56 
G-5, R.A.F. Matrices, and Pro- 

gressive Matrices *69 +27 SEKT] -66 
K-6, Spatial test "66 -18 *57 -80 
SP 4, Squares spatial °52 +16 sod oo 1551) 57 
Group Test 80, spatial 68 “09 +44 *66 


E A 
SP 117 E & M, Electrical and 
Mechanical Information sol S21 #13 1:55, | 362 
Mec-C, Mechanical information TrA bd "15 e S2 L78, 
Mec-B, Mechanical Diagrams *70 +08 *12 +42 | -69 
SP 122 Practical Problems *63 BZpT E H \ sos 


43:8 10:7 3-5 1:4 6-3 5:5 | 71-4 


Variance per cent, 


tests for dider children and adults. For most scholastic, and many 
vocational, purposes verbalgroup tests are far more useful than 
any non-verbal tests, because scholastic attainment is itself so 
largely a matter of v:ed as well as g. Non-verbal tests, whether 
pictorial or abstract, may be fairer to persons whose education has 
been disturbed, but they will seldom give as good predictions of 
educability or trainability. Their main use should be for research 
purposes, where it is desired to Separate off group factors from g. 

Stanford-Binet and Terman-Merrill Scales. Several analyses 
of these scales have been carried out by Burt (1939b), Wright 
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(1939), McNemar (1942b), Burt and John (1942) and Hammer 
(1948). These agree in showing that a general factor carries most 
of the variance, but that there are numerous small group factors 
whose make-up depends on the age level, i.e. the particular series 
of items, chosen. Most usually found are a verbal factor (e.g. in 
vocabulary), numerical (in digits, counting, and giving change), 
or alternatively an immediate memory factor, and a spatial- 
pictorial factor. It is doubtful whether any of these are sufficiently 
clear-cut or sufficiently consistent over a wide age range to justify 
testers in using the scales diagnostically, e.g. in claiming that a 
child is ‘good on the verbal side’, or ‘weak in memory®. So far no 
one appears to have analysed the scale among children with other 
reference tests. If this were done it would almost certainly be 
found that the general factor is partly composed of v. Thomson’s 
(1940) analysis of Stanford-Binet and eight performance tests 
among eleven-year-olds suggests that its g variance is about 
50 per cent. One might guess therefore that the rest consists of 
vzed 25 per cent., other group factors 10 per cent., specificity and 
unreliability 15 per cent. Alexander included Stanford-Binet in his 
factorial study of adult women, and analysed it into g 44 per cent., 
v 27 per cent., F (practical) 4 per cent., specificity 25 per cent. The 
group-factor content certainly differs at different ages, which 
means that the test does not always measure the same thing. 
Nevertheless it seems to be as useful as, or more useful than, any 
group test among children, partly because its g and v-ed content 
is so high, partly perhaps because it is less affected than group 
tests by artificial ‘formal’ factors (cf. p. 76). 

The Spatial Factor and Mathematical-Scientific Ability. 
Some further work on the space factor may be appended here. 
Among children and dull or average adults, k and 7 tests are 
strongly opposed, apart from g (cf. p. 34). But at high-grade 
levels an interesting alteration occurs (as Brigham seems to have 
realized in 1932). Both k and non-verbal g tests tend to link up 


with mathematical ability, and 7 becomes detached from the v-ed 


cluster. Probably this is due to the influence of science training 


both on mechanical-spatial and mathematical abilities. Thus 
among Vernon’s (1939) Training College students, Stephenson’s 
non-verbal g test correlated more highly with arithmetic and 
science subjects, a-verbal g test with education, geography and 
history. Similarly among Army engineering cadets, the corre- 
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lations of advanced mathematics and physics were greater with 
Matrices, while the more elementary arithmetic and algebra 
correlated more highly with the verbal Group Test 33. Finally, 
among 540 candidates for the higher Civil Service, the two 
factors shown in Table VIII bring out the main features of 
tet psychological tests, four examinations and two gradings of 
education. The first contrasts all the tests with the educational 
measures, while the second contrasts verbal tests and academic 
s a 
STABLE VIII. ROTATED CENTROID FACTORS FOR TESTS 


AND EXAMINATIONS TAKEN BY 540 CANDIDATES FOR 
THE HIGHER CIVIL SERVICE 


CISSB Test 70/1 


“55 “09 
Reading Comprehension -55 15 
Verbal Fluency "24 -58 
Current Affairs “15, 37 
CISSB Test 6, Verbal <12 -43 
Qualifying Intelligence Test “47 -40 
Qualifying Verbal -40 24 
Qualifying Instructions s35 “00 
Qualifying Orientation "54 —11 
Qualifying Cube-Counting "54 —:17 
Arithmetic Examination 257. —25 


Examination, General Paper p" 

English Precis —:09 
English Essay —'14 
Length of Education rating —11 
Education Standard tating 


measures with spatial tests and arithmetic. An 
factor, not shown here, linked the General Know 
the Current Affairs test, 

Sub-divisions of the Spatial Factor. 
made to sub-divide the spatial factor. 
kindergarten children, Kelle 


additional group 
ledge paper with 


Attempts have been 


The Thurstones in their 


igh school study (1941) noted a small factor in several tests 


Verbal and Non-verbal Factors in Intelligence Tests 75 


apparently involving visual pursuit, e.g. Mazes, but they have not 
followed this up. 

At a very different level, namely, highly selected aircrew, Guil- 
ford claims three spatial relations factors, a visualization factor, a 
length-estimation factor, and perceptual speed. Visualization 
occurred mainly in mechanical comprehension tests, though also 
in some k tests such as Paper Folding, and a test based on verbal 
descriptions of painted blocks of cubes. S2 is confined to Thur- 
stone’s Hands and Flags, tests. S1 was found in psychomotor tests 
of reaction time and complex co-ordination, in instrument and 
dial reading, and in certain spatial group tests; and S3 was another 
rather curious mixture. In his later publications Guilford (1948ab) 
appears to identify Visualization with the conventional k afd to 
define S1 as appreciation of spatial directions from the body (up 
down, to from, right left). But when Guilford and Zimmerman 
(1948) constructed tests for measuring these two factors separately, 
they correlated to about ‘5 among college students. Thus until 
much more confirmatory research is available, one would suggest 
that this plethora of spatial factors is more confusing than helpful. 

The relationship of k to mechanical abilities is discussed in 
Chapter X. An attempt to incorporate the main findings of this 
and the previous chapter in a diagram of mental structure is given 


at the end of Chapter VII. 


CHAPTER VII 


o 
PRACTICE, DIFFICULTY, SPEED, AND OTHER 
FACTORS 
° 

Abstract. Several unintentional factors, or extraneous condi- 
tions which influence factor content, are considered in this 
chapter. The form of the test item has not been shown to be of 

“great importance, but it is Possible that all objective (selective- 
response) tests embody a formal factor, which detracts from their 
educational or occupational value, 

Factor content is found to alter with practice, but no clear trend 
is discernible, unless the Practice is directed towards stimulating 
some ability. The more difficult, and the easier items of any one 
test may be answered by different methods by the more and less 
able testees, or else measure different abilities. Wrong responses 
to some tests may likewise show different factor content from 
right ones; and their predictive value should be followed up 
separately, j 

Speed of work can be partly distinguished under appropriate 


conditions from level, accuracy, or power, both with intelligence 
tests and (more readily) with si 


that there is little ground for the supposition that there exist a 
number of intellect 
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o 


citing Smith’s (1933) work, where fourteen tests were analysed 
among 186 students. Roughly one-third each were verbal, 
numerical and spatial; also about one-third were in completion 
(creative) form, one-third analogies and one-third classification. 
Using tetrad analysis, Smith concluded that any differentiation 
according to form is obscured by the stronger content factors, or 
else by unsuspected specific linkages. However, it is possible to 
apply Burt’s group-factor method to the correlations unaffected 
either by content or form, and to extract a g with some 35 per 
cent. variance. This leaves verbal, numerical and spatial content 
factors with about 15 per cent. variance, and Analogies and 
Classification formal factors with about 7 per cent., but no group 
factor for the creative tests. Thus this research suggests a ‘test 
ability’ factor or factors in the selective-response tests only. If 
this is confirmed, it would help to explain why the Stanford-Binet 
and Terman-Merrill scales appear to have better predictive value 
in daily life than do multiple-choice group tests. We know that 
larger practice effects occur on most group tests than on Terman- 
Merrill, and that these effects tend to spread from one test to 
another (cf. the summary in Vernon and Parry, 1949). Presumably 
testees become sophisticated to the instructions and the kinds of 
items used in all selective-response tests of intelligence, English, 
or other attainments; and as they differ in their degree of sophisti- 
cation, this creates an artificial group factor. 

Further evidence of formal factors was obtained by the writer 
in an analysis of forty-two items of the Progressive Matrices test, 
along with twenty-one other tests which served to define the 
factorial content of the items. The communality of the item 
factors approximated 40 per cent., and of this only some 24 per 
cent. could be regarded as content factors (g, v, k, perceptual, etc.). 
Both figures are unusually low because the testees wert a rather 
homogeneous group of 640 recruits. ® But the difference between 
them, 16 per cent., probably gives a fair estimate of the variance 
attributable to formal factors. Part of this amount represents a 
‘difficulty’ factor (cf. below), introduced by the imposition of a 
time-limit; for some testees spend more time on the early items, 
some more on the later ones. The rest appears to arise from a 
common form factor in all the items and from separate formal 
group factors in each of the sets of items of which the test is com- 


posed. 
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Effects of Training and Practice. Several investigations have 
_ shown that factor content may be modified by training (pp. 42, 116). 
Anastasi (1936) claims that group factors can be produced or 
obliterated by experience) on the basis of a study of five tests 
among 200 children. After taking two forms of the tests, the child- 
ren were given specialized training in new methods of answering 
three of them; e.g. they were shown how to employ spatial devices 
in certain reasoning tests. Some ten days later they took two more 
forms of the tests, and the inter-correlations and factor patterns 
showed marked alterations. : 

The effects of practice without directed training are somewhat 
obscure. McNemar (1936) found higher correlations between five 
‘psychomotor tests after three of them had been practised inten- 
sively. The first factor variance rose from 29-8 to 37-5 per cent. 
There was little alteration in a second, bipolar, factor. Woodrow 
(1938) had tests of arithmetic, anagrams, cancellation, length 
estimation, drawing ‘gates’, and others practised by fifty-six 
subjects for thirty-nine days, and then factorized initial, final, and 
gain scores, In another research (1939a), eighty-two subjects 
practised four tests sixty-six times, and initial and final scores were 
analysed along with several tests of known 


once) which helped to identify the factors. 
marked differences between the loadings of initial and final tests. 
Woodrow concluded that not only their common factor, but also 
their specific factor content had largely altered. He failed to find 
any general improvement factor, but the gain scores tended to 
correlate positively with N and P, that is with the factors under- 
lying the operations which were practised. They also showed 
negative correlations with his Attention factor. This suggests, 
quite plausibly, that the least ‘attentive’ subjects were able to gain 
most with practice. 
Heese (1942) gave an adding test and five psychomotor tests ten 
times to fifty students, and found positive but low correlations 
between the gain scores. Like Woodrow he denies a general im- 
provement factor, but the three rotated factors that he prefers 
seem to have little meaning. In another study by Greene (1943), 
four parallel forms of twelve miscellaneous tests (aiming, tapping, 
mazes, etc.) were given to 394 14-15 year boys. The first and last 
forms were analysed separately. The last forms showed more 
overlapping and stronger group factors (communalities of 49-7 


Both studies showed 


factor content (given | 
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and 59-4 per cent., respectively), and it is claimed that the factor 
patterns altered. But in the absence either of the original corre- 
lations or of the unrotated factors, it is difficult to say what, if any- 
thing, this investigation proves. Melton (1947): presents a factor 
analysis of seven psychomotor tests among 350 aircrew candidates, 
where several trials were given and were treated as separate varia- 
bles. He claims that the factor content of the tests altered consid- 
erably even over five to ten minute periods, some factors showing 
higher and some lower loadings as the trials progressed. Butin view 
of the low reliability of the scores for separate trials it would 
appear, to the writer at least, that there is no more alteration than 
might be expected by pure chance. 

Alterations of Factors with Difficulty. Several writers have 
drawn attention’ to ‘difficulty factors’, but their significance in 
practical mental testing has not yet been ‘worked out clearly. 
Hertzman (1936) studied the correlations between the scores on 
the easier and the more difficult halves’ of seven tests and, though 
he did not arrive at any factors, was able to show that the two 
halves often measured somewhat different abilities.» Ferguson 
(1941) points out that if items in a test (or sub-tests in a battery) 
are homogeneous in content yet different in difficulty, a spurious 
factor ot factors will be introduced, over and above the general one, 
contrasting the difficult and easy items (or tests).. This assumes 
that inter-correlations are calculated by product-moment or point 
coefficients. Wherry and Gaylord (1944) show that this can be 
overcome by substituting tetrachoric correlations, and recommend 
that this technique of correlation should be generally adopted for 
factorial investigations, unless all tests yield normal score distribu- 
tions. However, this is not the whole story. Thus Guilford (1941) 
analysed correlations (using tetrachorics) between the ten sets of 
items in Seashore’s Pitch Discrimination test, and found that the 
most difficult sets (5 to 0.5 cycle discriminations) measured quite 
different factors from the easier sets (30 to 8 cycles). One would 
have thought these sets to be highly homogeneous in content, but 
it is clear—either that the difficult ones measure a different pitch 
ability from the easy, oF that the more able subjects use different 
methods of discriminating from the less able. ! 

An investigation by the writer of intelligence and educational 
tests at 75 per cent. and 25 per cent. difficulty levels (cf. p. 30) 
yielded much the same pattern of group factors among the more 
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and less able, but showed larger group factor content, relative to 
&, among the latter. Shaefer (1940) makes the interesting sugges- 
tion that the perceptual P factor emerges only at a fairly low 
difficulty level; more difficult items of the same kind (or the same 
items answered by less able testees) presumably bring in reasoning 
o1 other ‘higher’ factors. He does not say whether he has ex- 
perimental evidence, but he may be inferring from the different 
results of presumed P tests when given to adult students and to 
high school pupils (p. 70). z b í 

While we shall probably continue to base our studies of test 
content chiéfly on product-moment correlations between (approxi- 
mately) normally distributed Scores, we should certainly not neg- 
lect the indications of these researches that content may differ 
considerably at different levels, 

Factors in Right and Wrong Answers. Another fruitful 
Suggestion of Guilford’s is that right scores and wrong scores on 
the same tests should be analysed separately, unless they correlate 
with each other more highly than, say, -80 (when corrected for 
attenuation). In several U.S.A.A.F. studies, scores based on the 
numbers of wrongs had different factor loadings from scores for 
rights, and different validities for aircrew selection. It follows that, 
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their actual ability. Cattell (1943) concludes, from his survey of 
the literature, that inability to do tests quickly is one of the main 
differentia between older and younger adults, and recommends 
that tests for adults of mixed age should usually be unspeeded. 
The problem is a complex one since the influence of speed varies 
not only with the age of the subjects and type of test material, but 
also with the instructions given or the methods of recording speed 
and power, accuracy or level. As Davidson and Carroll (1945) 
point out, most time-limit tests emphasize both aspects, and 
actually measure a mixture of the two components in varying 
proportions. > 
McFarland (1928) gives a useful critical review of the early 
literature. He points out-that many investigators have failëd to 
measure speed and power independently, but concludes that they 
are fairly closely correlated in the performance of mental tests. 
Himmelweit (1946) also reviews the many rather contradictory 
researches on speed and accuracy, and shows that they accord 
reasonably well with the generalization that speed and accuracy 
are highly correlated among tests of complex mental functions, 
but may be negatively correlated among motor or manipulative 
tests. Intermediate relationships are found for simpler mental 
tests or for mental + manipulative (e.g. mechanical) tests. The 
separation is increased when, as in many motor performances, 
subjects are aware of any errors they make. One would suggest in 
addition that correlations rise when skills are highly practised. 
The learner is apt to be either quick or accurate, but the experi- 
enced worker is usually quick and accurate, or slow and inaccurate. 
Thus handwriting speed and quality always correlate positively in 
the primary school, though not highly (cf. Burt, 1917, 1939; Gates, 


1924). 
In Himmelweit’s own experiments, common factors of average 


variance 35 per cent. were found among quickness scores, also 
among accuracy scores, on tests of addition, cancellation, under- 
lining words hidden in pied material, and the practical Track 
Tracer test, even when intelligence as measured by Progressive 
Matrices was held constant. In the absence of other reference 
tests it is difficult to specify the nature of these factors. The 
y concerned with showing that dysthymic 


investigation was mainl 
tc.) are slower and more accurate 


mental patients (anxiety cases, € 
than hysteric patients. 
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Proofs of the Speed Factor in Mental Tests. Davidson and 
Carroll’s (1945) investigation, at the college student level, gave 
convincing results. The Army Alpha and other verbal and 
numerical tests were scored for the time taken to try all items once, 
and for items correct without time limit, as well as by the ordinary 
time-limit method. Nineteen measures, apart from time-limit 
ones, were analysed and six factors were claimed. But several of 
these were inter-correlated, and it appears more justifiable 
merely to examine the second and third. (bipolar) factors, before 
rotation. These clearly differentiate speed scores from level 
scores, and’ verbal tests from numerical tests. Their variance is 
small compared with that of the first—general—factor, but both 
“approximate to 10 per cent. It is unfortunate that no non-verbal 
tests were given under the same conditions, but we have no reason 
to suppose that speed-level scores on these would not have been 
similarly differentiated. 


Similarly Slater (1938) gave untimed CAV (Thorndike) tests 
to 14-year children and arrived at S i 


difficulty levels, 
all around zero. 


Yet another technique of measurement has been developed by 
Furneaux (1948), Cyclic tests are used, consisting of batches of 
items of similar difficulty, each batch bein, 
preceding one. These are pi 


wever, these scores are more akin 


fulness’ than to Slater’s and Tate’s 
power and preferred-rate-of-work scores, 
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Discrepant Investigations. The high correlations between 
tests given with and without time limits do not disprove the 
existence of a speed factor, for the time-limit score is usually very 
largely a power one. Bernstein’s (1924) early study is often cited, 
where numerous tests were done by 11-13-year boys under condi- 
tions of leisure or of haste. No distinctive group factors emerged, 
and both sets correlated equally well with teachers’ assessments 
of intelligence. Moreover, the differences between scores on the 
two sets failed to correlate with (rather unreliable) estimates of the 
boys’ slowness at work. But these negative results are probably 
attributable to the fact that even the ‘leisure’ tests were done with a 
time limit. In other words the conditions of the experiment, and 
methods of measurement were inadequate to bring out the differ- 
ence that Davidson and Carroll found. Similarly in Sutherland’s 
(1934) experiments with group verbal, performance, and other tests, 
the conditions were imperfectly controlled. He partialled out (held 
constant) power scores obtained without time limit, and did find 
some residual correlations among time-limit scores, but was unable 
to prove that these yielded a statistically significant speed factor. He 
did, however, show that sucha factor was likely to be more promin- 
ent in simpler cognitive tests than in tests of higher mental functions. 

Speed in Tests Other than Intelligence Tests. It will be re- 
called that rate has been established as a partially distinct factor in 
at least at the adultlevel (p. 45). Kelley’s (1928) analyses 
d 9-year groups similarly postulated a 
separate speed factor in reading and arithmetic speed tests, over 
and above their general, verbal and numerical content. Whether 
this is the same as speed at intelligence tests has not been studied. 
‘As Sutherland and Himmelweit indicate, a distinct speed factor is 
most readily demonstrable in simple cognitive or in motor tests. 
Hargreaves (1927), Holzinger ( 1934-5) and Woodrow (1938) have 
used such tests as speed of writing figures or words, copying prose, 
coding (substitution), counting clusters of dots, and simple addi- 
tion sums, which show considerable overlap beyond g. Hargreaves 
found that this factor entered largely into so-called fluency tests. 
While there is insufficient evidence to justify identifying fluency, 
in the sense of wealth of associations, with speed, it seems quite 
possible that Thurstone’s W might turn out to be mainly the speed 
aspect of V. Presumably P and Q, the perceptual speed factors, are 
also closely related to mental speed. 


reading, 
of tests given to 13-year an 
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There is no clear distinction between speed factors and the ease 
vs. difficulty factors discussed above. Thus an investigation by 
DuBois (1932) showed much higher correlations between easy 
Arithmetic, Analogies, Directions and Vocabulary tests than 
between these and power tests. This result might be interpreted 
as evidence for an ease rather than a speed group factor. Yet 
another link is suggested by M. D. Eysenck’s (1945) work with 
senile patients, where a group factor was found among tests of oral 
and writing speed and in digit span or rote memory tests. This has 
not been duplicated in more normal groups of subjects. 

One might expect cognitive and manual dexterity tests to yield 
separate factors, apart from the slight dependence of both types of 
test ong. But Holzinger showed that there was a common element 
in his mental speed tests and in tapping, dotting, writing, and 
traving a simple maze, as well as specialized factors within each 


type. In other words there was a general speed factor which could 


be sub-divided. It is 


i pational Analysis also reports 
ctor with loadings for almost all the time 


factors, W, P, M, or to ease vs, difficulty? 
In the writer’s view, the soluti 
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sake of administrative convenience. Provided that the limits are 
generous these should yield reasonable approximations to power 
scores for the main factors—g, v, n, k, etc. But the important thing 
is to follow up power, speed and mixed (time-limit) scores, and 
find which yield the best validities. Thus in 11+ selection, the 
grammar schools presumably require a modicum of speed-at-work 
in addition to power. We ought therefore to find out the optimum 
weighting, and it might then well turn out that our present tests, 
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Fig. 4. Diagram of Intellectual and Practical Factors in 
Psychological Tests 


with slightly increased limits, would“reproduce this weighting as 
accurately as a cumbersome system of separate speed and power 
measures. Similarly in the vocational field, where the speed and 
accuracy of simple cognitive, or of manipulative, operations are 
more readily measurable, follow-up will show the appropriate 
combination for any job. And when a fairly wide experience has 
been gained in this way, it will be easier to decide the best means 
of picturing speed, accuracy, and other components of human 
abilities. 
G 
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Conclusions Regarding the Structure of the Main Factors 
in Psychological Tests. It would be more confusing than helpful 
to try to portray all the factors mentioned in this book in a single 
diagram. Hence educational and occupational abilities are shown 
separately in Figs. 3 and 7. The sensory, perceptual, aesthetic and 
other factors described in Chapter VIII are also isolated, in F) ig. 5. 
Here however in Fig. 4 we can bring together the findings of this 
and the two preceding chapters, and forestall those of Chapters IX 
and X. Personality and physical factors are omitted, though their 
influence (e.g. on fluency, or on athletic abilities) should not be 
forgotten. “It is not possible to show factors attributable to the 
form of the test, wrong answer factors, or alterations brought 
about by practice. But the speed factor is indicated by an enclosed 
area on the left of the diagram. Some of the links that seem to 


occur only in selected groups such as high-grade or low-grade 
subjects, are shown by dotted lines. 


ee 


. 


CHAPTER VIII 


SENSATION, PERCEPTION, IMAGERY, AND 
AESTHETIC ABILITIES 
o e 

Abstract. There is little or no empirical evidence for the 
numerous types of perception, attention, imagery, reaction time, 
etc., which are so popular in German psychology. A small sensory 
discrimination factor may be recognized, whose visual branch 
includes distinctive colour vision sensitivities. The auditory 
branch (which enters only to a small extent into musical and speech 
abilities) may also be sub-divided. Numerous visual perception 
factors have been distinguished by Thurstone and others, though 
their significance in educational, vocational, or abnormal psycho- 
logy is not known. Imagery types (visual, auditory, motor) can be 
distinguished by appropriate techniques. Musical discrimination 
is well substantiated, and it may be linked with aesthetic factors in 
the visual arts and in literature. 

Types of Perception, Reaction, etc. When the early experi- 
mental psychologists, particularly those in Germany, observed 
individual differences in people’s responses to sensory and other 
stimuli, they were apt to classify them into ‘types’. ‘Thus muscular 
and sensory types were distinguished in simple reaction time, 

. synthetic and analytic types of perception of tachistoscopic 
material, broad but shallow vs. narrow and concentrated types of 
attention, and so forth. Little or no attempt was made to deter- 
mine their consistency, e.g. whether people who were® synthetic 
with one set of material were synthétic with,other sets. As early 
as 1904, Spearman questioned this point. Reaction time types 
soon multiplied and were seen to represent little more than quick- 
ness vs. slowness. McQueen (1917) found that people who 
were able to ‘distribute their attention’ between one pair of tests 
were not superior in distributing it between other pairs. But 
German psychologists ignored such results, together with the 
more objective approach offered by Spearman ’s techniques. Even 
in the 1920s and °30s their work was characterized by a plethora 
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of unsubstantiated and unco-ordinated typologies (cf. Vernon’s 
summaries, 1933ab). Very few distinctive factors have emerged 
from British and American investigations in this field, though 
admittedly such investigations have been few in number, and 
much remains to be discovered. 

Sensory Factors. In his early study of forty-three schoolboys, 
Burt (1909) noted higher correlations between four tests of sensory 
discrimination (two-point threshold, lifted weights, pitch, length 
‘of lines) than could be attributed to g., He suggested that such 
discrimination constitutes a wide, though shallow, factor, since 
each measure of acuity has a large specific component; also that 
visual and auditory perception might yield distinct sub-factors. 
‘Little further work has been done, possibly because most of the 
tests have to be applied individually and are too time-consuming 
with large numbers. However, Carey (1915-16) confirmed the 
existence of an auditory group factor, but found no visual, tactile, 
or general sensory factor beyond g. Burt (1927) points out that his 
factor would not provide any justification for the popular notion 


of perceptive vs. reflective types. Such types, if they exist at all, 
are likely to be more a matter of te 


mperament and interest than of 
abilities. 

Differentiation between the senses 
means of memory tests, based on t 
presented visual, auditory or other 
memory factor, beyond g, 
very slight grouping acco 
found distinctive abilities 
gustatory + olfactory. Ta 

Factor analysis has been ap’ 


. Carey also studied types of imagery, but 
found the current objective tests to be quite useless. He was able 
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to derive fairly reliable measures of visual and auditory imagery 
from introspections, even among schoolchildren, but was unable 
to prove any differentiation between these modalities, or any over- 
lap either with visual and auditory discrimination or memory 
(apart from g). A more promising method was developed by Burt 
in 1912. He got several persons to assess the vividness of their own 
images of a hundred different experiences. There was fair agree- 
ment between all the ratings, showing that certain experiences can 
be imaged more readily than others by everybody. But some sub‘ 
jects tended to give higher ratings to their own visual images and 
low to their auditory and motor images, whereas othérs assessed 
their auditory images or their motor images relatively highly. By 
inter-correlating the ratings of twelve subjects, Burt (1938)°was 
thus able to show that the group factors corresponding to these 
imagery types carried 18-2 per cent. of variance, as compared with 
46-7 per cent. of variance attributable to the general order of 
vividness. Burt (1940a) has shown also that this technique of ‘cor- 
relation between persons’ reveals the same group factors as the 
more usual correlations between tests. Further investigation along 
these lines would be profitable. It is possible that visual and verbal 
types might be found more fundamental, and that important links 
might be established with aesthetic, practical and intellectual 
abilities. Clearly visual imagery has something to do with k. 
Another research suggesting an imagery factor is that of Ormiston 
(1939), but her tests are not fully described and no correlations are 


quoted. e . 
Perceptual Factors. By far the most extensive investigation is 


. that of Thurstone (1944), where forty-three sensory and perceptual 


tests (almost all visual) were given to 170 students, and ten factors 
identified. Chief of these was Facility and Firmness in Perceptual 
Closure, or the ability to see a relatively unorganized stifnulus as a 
good configuration. Several k tests snowed high loadings on this 
factor. Flexibility in Manipulating Conflicting Configurations was 
another factor present in two-hand co-ordination tests and in cer- 
tain problem-solving and reasoning tests. It sounds as though 
this was largely g. Susceptibility to Optical Illusions was distinc- 
tive, also Reaction Time to light or sound, and an Oscillation factor 
in rate of fluctuation of reversible perspective figures. Two differ- 
ent Speed factors appeared, but they do not seem to be related to 
the primary ability, P. Form and colour dominance tests failed to 
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yield a consistent factor, and measures derived from the Rorschach 
Inkblot test gave a factor among themselves (perhaps in the nature 
of richness of association), but showed little overlapping with any 
other perceptual factors. This is in marked contrast to the claims 
of Oeser (1932) and of numerous German psychologists that pre- 
ference for form or for colour constitutes an important type which 
connects up with Rorschach types, with Jaensch’s eidetic types, 
and with physique and temperament (cf. Vernon 1933b). Thur- 
Stone suggested that administrators, leaders, good and poor readers 
and other groups obtain distinctive profiles or patterns of scores 
on his perceptual factors. No further confirmation of this has been 
published, and in view of the negative results of much other re- 
search, we are entitled to doubt whether they have any significant 
bearing on educational or vocational abilities, 
They are more likely, perhaps, to be differentially affected in 
different types of mental illness. In a later article, Thurstone 
(1948) reports a closure factor in auditory tests, and proposes to 
investigate whether this is the same as his first visual factor. 

M. D. Vernon (1947) studied the perception of a variety of 
materials presented tachistoscopically, and showed by their inter- 
correlations that they fell into two main types. The first involved 
rapid discrimination of fine details of shapes, while the second 
depended on assimilation of shapes and comprehension of their 
meaning. She suggests a correspondence between these and 
Thurstone’s Closure and Flexibility factors respectively. The first 
also sounds very similar to non-verbal P factor. Actually however 
the second, not the first, gave moderate correlations with a spatial 
test and with AH4 Pt. 2—a test of §, k and (probably) P. She 
points out that there is great specificity in perception, since the 
performance of testees depends so largely on the particular condi- 
tions of the experiment, and on th 


i E e sets or ‘schemata’ by means of 
which they interpret what they see. 


One large-scale study was made durin 
perceptual and motor tests designed for 
for anti-aircraft work. This included visual acuity, aircraft spot- 
ting, two tests of perceptual acuity, pursuit meters, dotting, etc. 
In a group of about 400 women the overlapping was so small and 
irregular that a first factorization justifiably attributed it all to g. 
Re-analysis did however show additional factors with variance 
11-9 per cent., the g-variance being 4-0 per cent. These appeared 


or on personality. 


g the war of a battery of 
selecting A.T.S. recruits 


-3 


, 1934). Thus among seventeen 


Sensation, Perception, Imagery, and Aesthetic Abilities 91 


to be a general sensory-motor factor (perhaps the same as Guil- 
ford’s Psychomotor Co-ordination), and sub-factors for the 
sensory-perceptual and the co-ordination tests. The statistical 
significance of these factors was doubtful, and none of them except 
g showed any relationship to proficiency. In the U.S.A.A.F., 
Guilford and Lacey found avdistinctive factor in tests involving 
estimations of lengths and distances. The Thurstones (1941) also 
note a narrow factor at 14 years in several teŝts involving counting 
the numbers of dots in patterns. , $ 

Auditory Factors. Rather more work has been done in the 
auditory and musical fields. The Seashore and other tests such as 
Drake’s and the Kwalwasser-Dykema series always gives a group 
factor beyond g, but tend to be so poor in reliability that no‘con= 
sistent sub-grouping is found. For example, in Drake’s (1939) 
analysis of four Seashore and four other tests among 163 boys aged 
around 13, there was a general factor with over 30 per cent. 
variance, and strong residual overlap between Pitch and Intensity, 
Pitch and Tonal Movement (Kwalwasser), Tonal Movement and 
Tonal Memory (Seashore). Manzer and Marowitz’s (1935) 
correlations between the ten Kwalwasser-Dykema tests among 
452 students suggest a musical training factor (in Pitch and 
Rhythm Imagery, Tonal Memory and Tonal Movement), and a 
sensory factor (in Time, Quality, Rhythm, Pitch and Intensity 
Discrimination, and in Tonal Memory). Several other researches 
indicate that elementary auditory capacities such as those measured 
by the Seashore tests have very little relation either to musical 
ability or to perception of speech (cf. Howells and Schoolland, 
tests given by the writer to some 
a Musical Knowledge test and total score on 
the Oregon Music tests had general musical factor loadings of 
-84, whereas the Seashore Pitch and Rhythm tests had loadings of 
-28 and -35. The Seashore Tonal Memory test, however, had a 
general factor loading of 65, and the three Seashore tests had a 
considerable group factor of their own. 

In an extensive research, Karlin (1942) gave thirty-two tests, 
mostly auditory, to 200 high school pupils, and identified eight 
factors after rotation. Though he claims that auditory abilities are 
complex and yield no general factor, in fact almost all his corre- 
lations were positive. The first (unrotated) factor carried some 
15 per cent., and the combined bipolars 26 per cent. of variance. 


seventy students, 
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We would agree that group factors are more prominent than 


general auditory ability in his very miscellaneous battery. The 
main factors appeared to be: 


Pitch discrimination for complex and pure tones, and quality 
discrimination. 

Loudness discrimination. 

Time discrimination (overlapping with loudness, and other 
tests) 

‘Perception of masked and distorted spéech. < 

The remainder, involving auditory span, various memory tests, 
etc., were less clear-cut. 

` Musical Ability. Wing 

of music tests 


6 


intensity. This (or rather th 
correlates only to about -3 w. 
to the author, but little aff 
experiments where reliable assessments of musical ability were 
obtainable, its validity averaged -80, Bipolar factors, with 
variances 13 -4 and 3-1 per cent., indicating the presence of group 
factors, separate the tests of perception of alterations in melodies 
t of suitable harmony, rhythm, 
ion is indicated between ability 


hm and Melody scores, On the other 
. d, j 
firming Wing’s third factor. yes 


= 
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Like Wing, Vidor considers musicality to be independent of 
training, at least in children up to about 15. A closer integration 
seems likely to develop later; for in the writer’s research, measures 
of training and musical knowledge correlate very highly with 
Oregon musical judgment scores, and with any other tests based 
on musical (as distinct from auditory) material. The writer also 
has some evidence to support the popular view of a connection 
between mathematical and musical abilities; though he knows of 
no published proof. asf 

Aesthetic Discrimination. The correlation-between-persons 
technique has proved useful in the visual and literary arts. Burt 
(1933) collected fifty reproductions of miscellaneous paintings, and 
on getting these arranged in order of appreciation by artist$ and 
artistically naive adults and children, found a strong tendency to 
uniformity. Hence a person’s approximation to the standard order 
can be used as a measure of his artistic discrimination. Eysenck 
(1940) showed that the same factor extends from paintings of 
landscapes, portraits, etc., to other visual material such as pictures 
of clocks, embroidery and vases, and abstract curves and polygons, 
and even to odours. As with imagery, subsidiary type factors can 
be established, some individuals for example tending to rate for- 
mal, classical art more highly than colourful, impressionistic or 
representational art, some the reverse. These, however, might be 
regarded as attitudes or tastes, which fall outside the scope of this 
book, whereas the general factor is more akin to an aesthetic 
ability. Guilford and Holley (1949) suggest that an individual’s 
communality may be regarded as measuring his ‘objectivity’ of 


, judgment, whereas his specificity shows the subjective element in 


his tastes. š 
Dewar (1938) found positive though low correlations among 


children between a modification of Burt’s test, and*other art 
judgment tests such as McAdory’s, Meier-Seashore’s and Bulley’s, 
and teachers’ judgments of the children’s artistic ability. This 
confirms the existence of a general factor, but suggests that in 
children discrimination is rather unreliable, being much affected 
by specific factors derived from the method of testing, the material 
used (e.g. paintings, furniture, etc.), and types of taste. Like 
musical ability, it correlates with intelligence, but not highly. 
Similar tests consisting of literary passages were developed by 
Williams, Winter and Woods (1938), and applied to groups of 


94 The Structure of Human Abilities 


girls aged 11-17, totalling 256. Here a general literary discrimina- 
tion factor showed a rather high correlation with a verbal in- 
telligence test, i.e. with g +v. In fact its variance was reduced 
from some 53 per cent. to 16 per cent. by holding intelligence 
constant. The literary factor also showed some overlap with tests 
of artistic and musical discrimination, but it is not indicated 
whether this might be accounted for by g, or whether there is a 
general aesthetic capacity for all types of art, plus more specialized 
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Fig. 5. Diagram of Sensory, Perceptual, Imagery and Aesthetic 
Discrimination Factors 


factors in"the different arts. The former might well be true of 


children, though Eysenck’s’ results 
relatively sophisticated adults, eee 


Conclusions. It is Particularly difficult to express the findings 


of the present chapter diagrammatically, because of the lack of any 
comprehensive investigations which might show the relations 
between sensory, perceptual and aesthetic, and well-established 
factors. Moreover, exploration of this field has been extremely 
Scrappy, and an indefinite number of other such factors will 
probably be revealed by further investigation. There is little or no 
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channelling of perceptual-imaginal abilities into a few major 
branches by cultural influences as there is in the intellectual field. 
No attempt has been made, therefore, in Fig. 5 to indicate the size 
of g-loadings by distances from the centre, nor the amount of over- 
lapping by contiguity. Nevertheless, the diagram serves to put 
almost all the factors mentioned in this chapter into some sort of 
order. Note that no general perceptual or imagery ability (akin to 
general educational or practical ability) is claimed; also that, al- 
though, all auditory and musical functions are probably linked, 
there is no proven connection between visuo-perceptual, visual 
imagery, and visual art factors. 


CHAPTER IX 
PSYCHOMOTOR AND PHYSICAL ABILITIES 


° Abstract. There is sufficient overlapping among manual and 
sensory-motor tests, particularly in unselected groups, to justify 
the conception of a psychomotor factor over and above g and k:m, 
together with group factors for special types of performance. But 
their variance is so small that their limits and essential content 
are not known, and they may be much affected by age, sex, prac- 
tice, etc. Overlapping is greater among the more complex tests, 
also in low-grade subjects, but this may be due to their greater de- 
pendence on mental factors. 

The existence of a general physical-athletic factor is well sub- 
stantiated, together with clear-cut group factors in different types of 
athletic performance. G and k:m enter to a small extent, and there 
seems to be a linkage between gross and fine muscle co-ordinations. 

Specificity of Psychomotor Abilities. 
motor and manual tests (usefully summari 
Pear, 1932) revealed very 
between them and intelli 
they usually have to be 
possible to test large gro 
reliable and lacking in s 


Early work on sensory- 
zed by Weiss Long and 
low correlations among such tests, and 
gence or vocational proficiency. Since 
given individually, it has seldom been 
ups, and all correlations tend to be un- 
tatistical significance. At the same time 
there is not complete specificity; most of the correlations are posi- 
tive, though some may be negligible and even negative, especially 
in high-grade or restricted groups. 

In general the overlapping is greater among more complex tests. 
For example, Farmer’s (1927, 1929, 1936) battery of ‘aestheto- 


kinetic’ tests, namely, Choice Reaction, Dotting Machine and 
Pursuit Meter, has consistently yielded a group factor, beyond g, 
the average inter-correlation being about -25. This battery gives 
small correlations with several trade skills, and with freedom from 
accidents. Farmer’s work Suggests too that accident proneness is a 
fairly broad factor. For example, those who undergo many 
accidents in their jobs tend to do the same at home. 


ee 
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R. H. Seashore (1930, 1940) has developed and standardized a 
reliable battery of six tests, known as the Stanford Motor Skills 
Unit, namely: 


Spool Packing, speed of bimanual co-ordination. 

Koerth Pursuit Rotor, accuracy in following a moving target at 
high speed. 

Motor Rhythm, precision in repeating amauditory rhythm on a 
tapping key. 2 

Serial Discrimination; quickness in making discriminatory finge 
responses to number signals. 4 

Tapping speed, on a telegraph key. 

Speed Rotor, speed of rotatory arm wrist and finger movement 
in turning a hand drill. 


Though he denies the existence of a general motor ability, Sea- 
shore’s own figures and those of Walker and Adams (1934) yield a 
mean correlation of over «3, even among students. He states that 
the degree of specificity is as great among children down to six 
years as among adults. McNemar (1936) gave five tests (two from 
Seashore’s battery) to 182 junior high school boys, and found a 
stronger general factor when some of them had been practised. 
The mean correlations were -266 before and -392 after practice. 
In low-grade groups such as the defective boys tested by Atten- 
borough and Farber (1934) and the senile adults tested by M. D. 
Eysenck (1945), the mean correlation between quite simple tests 
(pegboard, tapping, nuts and bolts, etc., among the former, and 
ergograph, steadiness and aiming among the latter) average close 
_ to +4; that is, they show some 40 per cent. of common variance. 
Group Factors. Tests of closely similar functions tend to yield 
higher inter-correlations, thus suggesting that it would be more 
legitimate to talk of dexterities rather than of dexterity or of a 
general psychomotor ability. However, there is little agreement 
so far as to which group factors are the most consistent and 
distinctive. 3 
Earle and Gaw (1930) compared tests depending mainly on 
depending more on precision of finger, hand and 
arm movements, and found an average inter-correlation of +36 
among the former, :29 among the latter, but only -13 between the 
two types of test. They confirm Burt and Moore’s (1912) finding 
that boys tend to be superior on straightforward speed and 


speed and others 
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strength tests, girls on tests that appear to involve ‘neat-fingered- 
ness’. Thus this grouping may be related to sex differences. 
Buxton (1938) gave nine steadiness, tapping and packing tests 
to seventy-six boys and obtained very low average inter-corre- 
lations. But factorial analysis suggested the presence of narrow 
group factors in closely related skills. More extensive was Sea- 
shore, Buxton and McCollom’s (1940) investigation of nineteen 
tests among fifty students, which yielded the following groups: 


‘Simple reaction times. t 

Tapping, oscillatory movements in one plane. 

Stylus tapping, between two or more plates. 
~ Postural sway. 

Serial discriminatory reactions. 

Pursuit co-ordination. 


Motor rhythm. 


The grouping appeared to be functional rather than anatomical, 
for alterations in the particular muscles or senses employed reduced 
the correlations less than did changes in the patterns of behaviour. 
Dudek and Seashore (1948) quote similar results. The investiga- 
tion by Allport and Vernon (1933) of expressive movements like- 
wise indicated the existence of fairly general patterns of action in 
numerous simple tasks (drawing, walking, etc.). For example, 
there was an areal or expansive tendency, a contrast between 
centripetal and centrifugal movements, and a factor of strength 
or emphasis. These types of movement appeared to be related to 
underlying personality traits. In addition the natural or normal 
speed adopted in forty-five varied tasks was recorded, and though 
there was a slight tendency to positive inter-correlation throughout 
there Were more marked group factors of verbal speed (reading, 
counting, also handwriting), drawing speed (on paper, on black- 
board, with foot), and rhythmic or motor speed (tapping or con- 
traction of various muscles). The Division of Occupational 
Analysis finds two factors among apparatus tests which are identi- 
fied as F (finger dexterity) and M (manual dexterity), The Min- 
nesota Placing and Turning tests and a new pegboard test are good 
Pere of M, while F emerges in fine assembly work. In addition 

ere is a hand-eye co-ordination or aiming factor in drawing and 
writing tests (cf. p. 134), and a speed factor in manual and other 
tests (cf. p. 84). The correlations on which these factors are based 
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have not been published, hence the variances cannot be assessed. 
Guilford and Lacey (1947) and Melton (1947) find a broad 
Psychomotor Co-ordination factor in a complex serial reaction 
time test, pursuit, aiming, and finger dexterity tests, and various 
smaller factors among pairs of tests. Other factors described by 
these authors (S1, $3, cf. p. 75) combined abilities at certain 
apparatus tests and at paper-and-pencil tests. . 

Dependence of Psychomotor Abilities on Higher Factors. 
According to the hieragchical viewpoint, psychomotor factors 
branch’ off from the major k:m or practical group factor, and 
several researches do show loadings of sensory-moto? tests with 
higher factors, at least in heterogeneous groups. 

Van der Lugt (1948) makes the interesting suggestion that’ thie 
correlations of manual tests with intelligence are non-linear, being 
moderately high among very dull children, and zero or even 
negative among very bright ones. This is borne out by Atten- 
borough and Farber (1934), who obtained coefficients averaging 
-52 between dexterity and tapping tests and Stanford-Binet and 
Otis Primary tests among eighty boys whose I.Q.s ranged from 
45 to 105 (median 70). Again M. D. Eysenck (1945) found a 
mean correlation of -26 between the Matrix test and Steadiness, 
Tapping, Dynamometer and Ergograph tests among seventy-five 
senile patients. 

Investigations in the Services. Few psychomotor tests were 
used on a large scale in the British Services. However, Table IX 
gives a group-factor analysis of thirteen tests applied to 500 naval 
recruits—a moderately, but not highly, selected group. Test 102 
involves judging the tension of stretched strings. Test 104 con- 
sists in picking up ball-bearings with tweezers, spoon and fingers. 
The others have been described elsewhere (Vernon, 1947b). The 
Table shows the two major factors and their sub-divisions into 
verbal numerical, informational, spatial, and manual dexterity 
minor factors.! Here the g and k-m variance of the two psycho- 
motor tests amounts only to 3-1 per cent., their minor factor 


content'to 7-3 per cent. 
Three apparatus tests were regu 
SMA3, a hand-eye co-ordination test, 


larly applied in the R.A.F.— 
and a finger dexterity test 


1 Tt should be pointed out that the extraction of as many as eight factors from 
500 cases is open to criticism, and that minor factor loadings derived from only 
two tests each are indeterminate. 
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(cf. Vernon and Parry, 1949). An analysis of these and nineteen 
other tests among 785 rather highly selected aircrew candidates 
gave an average g-variance of 4-7 per cent., k:m or perceptual 
factor variance of 6-0 per cent., and an additional psychomotor 
factor of 16-2 per cent. 

Other Studies. Teegarden (1942) quotes correlations for large 
and heterogeneous groups of young adults between the Kent- 
Shakow Formboard (simple speed problems and complex prob- 
lems scored Separately), Minnesota Spatial Relations Formboard, 


TABLE Ix." GROUP-FACTOR ANALYSIS OF MECHANICAL AND 
OTHER TESTS AMONG 500 ORDINARY SEAMEN j 


Inf. Spat. Dext, 


a) Progressive 
Matrices 
1 Abstraction 
1 Dictation 
3a Arithmetic 
3b Mathematics 


2 Bennett Mechanical| - 
100 Mechanical Infor- 
mation 
101 Electrical Informa- 
tion 


4 Squares Spatial 
emory forDesigns| + 
103 Wirebending s 
102 Tension 
104 Ball-lifting 


Variance per cent. 


Minnesota tests of 
and Cincinnati Pliers Dexterity 


only -16 with their adaptati 
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speed _tests correlated *22 with Stenquist and gave similar 
coefficients, ranging up to -36, with the (paper-and-pencil) Form 
Relations test and Group Test Thirty-four, and with several 
standard performance tests. A somewhat discrepant result was 
obtained by Shuttleworth (1942) among 109 13-year technical 
school boys, who took sixteen tests used in the Birmingham voca- 
tional experiments. The Ball-lifting test (SP Test 104) gave a 
correlation of -24 with his general factor but on the bipolar factor 
separating k.m from v:ad tests, neither this dexterity test not 
Memory for Designs and Squares spatial tests, showed any 
mechanical loading. The selectivity of the group may perhaps be 
responsible. 

Cox’s Investigations. Perhaps the most illuminating wofk fh 
this field is that of Cox (1928, 1934), though based chiefly on small 
groups of schoolboys and on the inconvenient (yet highly delicate) 
tetrad difference technique of factorization. In his earlier experi- 
ments he intentionally eliminated any element of manual dexterity. 
by using paper-and-pencil tests with pictures of mechanical 
models, He was able to establish overlapping in such tests, beyond 
g, which he attributed to m factor, that is the capacity for com- 
prehending and employing mechanical relationships and prin- 
ciples. Later he studied tests of assembling and stripping, and 
showed the same factor to be present in these, particularly in the 
less routine assembly processes. When routine assembly and 
stripping and other dexterity tests such as pinboard and eyeboard 
were compared, there was consistent evidence of small residual 
overlapping, with g and m held constant, which he attributed to a 
‘routine manual factor’. Additional minor group factors were 
indicated among sets of closely similarly dexterity tests. Each test 
of course, also involved a specific factor, and this specific variance 
was much greater in the dexterity tests than was their g, m, manual 
or other factor content. Z 

Minnesota Investigation. Some of the inter-correlations ob- 
tained in the Minnesota study of mechanical ability are shown in 
Table X. Though they were not subjected to group factor or 
multiple factor analysis by Paterson and Elliot, they indicate the 
presence of a prominent general factor, presumably a mixture of 
gandk:m. Clearly there is also a v:ed group factor in the Academic 
Grades and Otis I.Q., possibly entering into the Paper Formboard 
and Information tests. Packing Blocks and Card Sorting show a 


H 
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dexterity factor which overlaps into the first five mechanical and 
spatial tests, though not into the informational and interests tests. 
Wittenborn (1945) analysed these and other Minnesota figures by 
the centroid method, and his results tend to confirm these sug- 
gestions. He claims in addition a Maturational factor in dyna- 
mometer tests, height and weight; a Strength factor, mainly in 


dynamometer tests, and possibly a Steadiness factor, and Per- 
ceptual Speed. 


y 2 
TABLE X. SELECTED CORRELATIONS FROM THE MINNESOTA 
INVESTIGATION OF MECHANICAL ABILITY 


1. Quality of Shop 
Work 


2. Mirmesota As- 
sembly Test 


—— eee 
3. Minnesota Spat- 
ial Relations 
Formboard 
4. Paper Formboard| + . -63 
5. Stenquist Picture 
Test S tag *39 +30 
e l cei | 
6. Mechanical In- 


formation A2 S “40 +57 -34 
7. Mechanical In- 
terests 46 +39 -28 
«Home Mechani- 
cal Operations | -30 - *22 +24 -19 
aes 
9. Otis 1.Q, 


21 « “18 +53 -18 
10. Academic Grades] -42 - "26 -40 -28 


11. Packing Blocks 
12. Card Sorting 


tests and schooling. Doubtless thi 
(2) A spatial factor in paper-and. 
Counting and Minnesota F ormboard. 
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(3) An age and experience factor entering into some of the 
mechanical tests, but correlating with poor performance 
on some of the manual speed tests. 

(4) A manual dexterity factor especially prominent in Pinboard, 
Dotting, Assembling and Stripping Nuts and Bolts, and 
Packing Blocks. Half of the dexterity tests gave apprecia- 
ble loadings, and few showed significant loadings with 
the other four factors. 2 

(5) A factor which Harrell suggests is perceptual. But as it has 
loadings on Routine Assembling and Stripping, Mech- 
anical Assembly and Stenquist Picture tests, one would 
have thought that it approximates more to a mechanical 
factor than to non-verbal P. Pre 


Studies of the MacQuarrie Test. A number of investigations 
which throw some light on manual and higher general or group 
factors have been made with the MacQuarrie battery of paper-and- 
pencil tests of so-called mechanical aptitude. Bingham (1937) 
quotes correlations for employees of the Scovill Manufacturing 
Company between the battery as a whole and verbal and per- 
formance tests, which suggest, according to a group-factor analysis 
by the writer, that it has a g-loading of at least -6 and a smaller k- 
loading. Jorgensen (1934) has also found it to have high g-content. 
Harrell (1940) included it in the research just mentioned. The 
Copying, Location, Block-Counting and Pursuit tests were 
chiefly covered by his first two factors (g + %), while the Tracing, 
Tapping and Dotting tests depended on the first and fourth 
(g + manual). Goodman (1947) and Chapman (1948) factorized 


‘the inter-correlations among 329 radio assembly workers, and 


likewise established a spatial (or g + 4) factor, and a manual factor 
in the three latter tests. There were indications of a perceptual or 
visual inspection group factor in some of the tests. This is borne 
out by Murphy’s (1936) analysis of eighteen tests among 143 
14-year boys, which included Copying, Tracing and Tapping from 
MacQuarrie. Copying fell in a cluster with his spatial tests, but 
the other two formed a distinct group factor with Substitution and 
Checking tests from Army Beta, that is with tests akin to P. 
Conclusion. Although psychomotor abilities do not seem to be 
so devoid of structure as early experiments suggested, it is never- 
theless true that they are predominantly specific. Thus the notion 


104 The Structure of Human Abilities 


of dexterity or handiness as a general factor in manual occupa- 
tions, which can be measured by one or two pegboard, nut and bolt 
or similar tests, should be discouraged. The diagram in Fig. 4 
attempts to portray the main results reviewed above. ’ 

Physical Abilities. Jones and Seashore (1944) point out that 
greater generality is usually found among gross than among fine 
- muscular capacities, and that there is much more justification for a 
general athletic than for a general motor factor. Several analyses 
of physical measures and athletic tests have been published in the 
Research Quarterly of the American Association for Health and 
Physical Education (Wendler, 1938; Hall and Wittenborn, 1942; 
Brace, 1946, etc.).1 These are said to reveal factors of Strength, 
Speed, Agility, etc. Such trait-names, however, are apt to mislead 
(cf. Appendix, p. 134), and it would be better to describe the 
factors in terms of the performances they cover. McCloy (1940) 
claims that strength, speed of movement and dead weight factors 
consistently emerge in such studies. He quotes correlations be- 
tween six athletic measures, four dynamometer tests, and weight, 
among 163 junior high school girls. All correlations are positive 
and (except those for weight) fairly large, hence a group-factor 
analysis appears more appropriate than his rotated centroid 
analysis. Such an analysis by the writer. indicates: 


(1) A general physical factor in all measures except weight, with 
some 42 per cent. variance. 


(2) A ‘strength’ factor in dynamometer tests, weight, and shot- 
putting and ball-throwing. 

(3) A group factor in two jumping events and one running, 
weight being negatively loaded. The variance of Nos. 2 
and 3 amounts to some 26 per cent. 

A similar research by Highmore (1949) carried out with male 
physical training students gáve a general factor with 32-0 per cent. 
variance, and group factors with 11-7 per cent. for: 

(1) Running events. 

(2) Putting, throwing and kicking, 

(3) Standing and running, broad and high jumps. 


A study of nine athletic tests applied to 450 Army recruits was 
made during the war. When §, age, height and weight were held 
1 Other references are listed, and reviewed, by Highmore (1949), 
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constant, a general physical factor accounted for 27.4 per cent. of 
variance, and another 16-4 per cent. was covered by group factors 
which include: r 
(1) Running 100 yards, running 1 mile, walking 5 miles, Army 
Agility test. 

(2) Long jump, high jump, obstacle race. 

(3) Two types of chinning or pull-up tests. 

As in the case of psychomotor abilities, physical ones show some 
impregnation with g and k-m, though their group-factor content is 
much larger. The mean g-loading of the above nine measures was 
only -11 (variance 1.2 per cent.). However, in another investiga- 
tion of 578 Army recruits, thirteen tests or measures were analysed. 
After the extraction of a large g and a v-ed factor, there were small 
residual correlations, shown in Table XI. These clearly indicate 


TABLE XI. CORRELATIONS BETWEEN MECHANICAL AND 
PHYSICAL TESTS AFTER EXTRACTION OF G AND V:ED 


Med. 
loading Squ. Mech. Ass. | Cat. Youth Agility 


Test 


4 Squares Spatial 
2 Bennett Mechanical 
8 Mechanical Assembly 


Medical Category 
Youth (age reversed) 
16 Agility 

10 Morse Aptitude 


mechanical and physical group factors. The latter extends into the 
Morse Aptitude test and suggests a link between physical and 
` auditory abilities. In addition most of the physical measures corre- 
late slightly with the spatial-mechanical, particularly with the 
manipulative assembly test. The same figures have been analysed 
by Banks (1949), with similar results. Overlapping between 
physical and mechanical abilities is shown also by the discovery 
(cf. Vernon and Parry, 1949) that the Army Assembly test is usually 
more valid in predicting proficiency at jobs requiring physical 
effort, in Infantry, R.A.C. and R.E., than in predicting mechanical 
skill and trainability. 
We would expect to find a closer association between all these 
factors among low-grade adults or children than among high-grade. 
No investigation of physical tests along with manual, mechanical 
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and intelligence tests appears to have been carried out with a 
minded subjects, but a study of African recruits is re evan i 
Thirteen tests given to between 308 and 631 men are listec a 
Table XII. Most of these are simplified adaptations of Britis 

ones; the Fourth Corner is a performance test along the lines of 
individual Matrices. Three factors were extracted, but the third 
failed to yield any logical grouping, probably because of the varying 
numbers of cases, and the two listed in the Table gave a good fit. 
The first, general, factor hardly represents g, in the sense of 
educing relations, since it gives such large loadings to the simple 
formboards and dexterity tests. Probably it corresponds to a 
general adaptability to the unfamiliar testing situation. Mechanical 
Comprehension gets the poorest loading, perhaps because it is the 


TABLE XII. FACTORIZATION OF TESTS GIVEN TO AFRICAN 
RECRUITS 


Test 


Unrotated Factors 


Arithmetic 

Progressive Matrices (revised) 

Fourth Corner Test 

Block Design 

Cube Construction 

Mechanical Comprehension 
ormboard, circular insets 

Formboard, square insets 

Mechanical assembly 

Screwboard Dexterity ‘ 

Reversible Blocks Dexterity 

Pegboard Dexterity 

Agility 


most un-African and the least reliable test. The second factor 
clearly divides the tests into the primarily cognitive, including 
verbal, pictorial and performance, and the primarily manipulative 
and physical. The Formboards partake of bo 
so show the lowest loadin 
British g, while the op 
dexterity and physical fa 
speculations. Should 


SN 


= 


CHAPTER X 


PERFORMANCE TESTS AND MECHANICAL 
ABILITIES -° 
° e 
Abstract. Investigations of performance and mechanical tests 
tend to yield very diverse results owing to the effécts of back- 
ground, age, training, etc. Much work has been done in the 
Services, as well as among children. This shows that g usudily 
plays a large part, except in the more unreliable performance tests 
and in mechanical assembly tests. In general, performance tests 
measure the same g + k factors as paper-and-pencil spatial tests, 
while mechanical tests measure these factors and a mechanical ` 
information or experience group factor. Thus, apart from their 
greater attractiveness, practical or manipulative tests show little 
or no advantage over paper-and-pencil tests. Additional minor 
group factors are,indicated among special types of performance 
tests, and within the field of mechanical information. 
What Do Performance Tests Measure? Tests such as Form- 
boards, Picture Completion, Porteus Mazes, Kohs Blocks, and 
Cube Construction have been widely used for some thirty years in 
clinics and in vocational guidance (cf. Gaw, 1925), to measure a 
more ‘practical’ type of intelligence than the mainly verbal Binet 
„or group tests. It has often been assumed that they give some 
indication of aptitude for mechanical or manual jobs, or for tech- 
nical as contrasted with academic education. McFarlane (1925) 
found some overlapping, beyond that due to g, among boys but 
of practical tests including Cube Construc- 
a wheelbarrow out of wooden pieces, and 
the testee has to puzzle out an elaborate 
nings to open the box. Spearman, how- 
ever, considered that performance tests are merely rather unreli- 
able g tests, and Cattell (1936) reiterated this view. Kohs (1923) 
put forward his Block Design test purely as a measure of general 
intelligence. El Koussy’s (1935) survey of the literature (up to but 
not including Alexander’s work) also showed that the weight of 


not girls in a number 
tion, the assembly of 
Healy’s puzzle box where 
system of latches and faste 
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evidence was against any single or clear-cut practical factor in all 
performance tests, at least among children. Neither Pintner- 
Paterson’s, Gaw’s, Drever-Collins’s, nor Arthur’s standard batter- 
ies of tests appear to have been analysed thoroughly, presumably 
because of the time taken to test sufficient numbers. Kelley (1928) 
quotes correlations for small groups of 13-year boys and girls 
between eight of Gaw’s tests, but as he chose the ones showing 
most independence it is not surprising that they failed to yield any 
l6gical pattern. Schiller (1934) included three of the poorer Pint- 
ner-Paterson tests, also Drawing a Man and the non-verbal Army 
Beta and Otis Primary group tests in an investigation of twelve 
tests among 395 nine-year boys and girls. A group-factor analysis 
by the writer indicates that the Pintner-Paterson total scores and 
Drawing a Man have §-variance of about 20 per cent. and k- 
variance of 10 per cent. or less, i.e. very high specificity, the same 
factors being present in the non-verbal group tests. 

Factorial Studies of Performance Tests. Morris (1939) gave 
the Pinter-Paterson tests and a series of mani 
what selected group of fifty-five boys aged 
that many of the performance test inter- 
Sometimes even negative, that they cann 
same ability, and claims that the three r 
extracted correspond to Thurstone’s Sp 


ual tests to a some- 
94. He points out 
correlations are so low, 
ot all be measuring the 
otated factors which he 
ace, Induction and Per- 


€a moderate test of g, while its six easy 
rent ability—probably a kind of F 
ual dexterity. wS Se es 


= 
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Thomson (1940) analysed the performance tests used in the 
Scottish individual test survey of cleven-year-olds and obtained a 
general factor, present also in Stanford-Binet, together with some 
indication of a separate group factor in the speeded tests. But as 
the battery only contained one g or g + v test, a distinct perform- 
ance factor could not be expected. Burt (1940b), however, showed 
that additional minor factors were present in the performance 
tests, one for speeded tests, one for pictorial dnd a third ‘linguistic’ 
factor in tests with elaborate verbal instructions such as Knox, 
Cube, Kohs, Cube Construction and Binet. = 

Earle and Milner (1929) applied numerous tests ‘to over 300 
13-14-year children, and found that they could be classified into 
three main groups: ya 

(1) Stanford-Binet and Group Test 34—g + v tests with high 

loadings. | P l 7 
(2) Cube Construction, Dearborn Formboard, Form Relations 
and Memory for Designs—performance} and paper- 
and-pencil tests—also Stenquist Assembly. These were 
partly dependent on g, but showed additional overlap 
suggesting a spatial-practical-mechanical group factor. 

(3) Other performance tests including Cube Imitation, Substi- 

tution, Picture Completion II and Porteus Mazes—g 
tests with low saturations and no other common factors. 
The mean correlations within and between these groups 
are shown in Table XIII. 


TABLE XIII. EARLE AND MILNER’S CORRELATIONS BETWEEN 
DIFFERENT TYPES OF TESTS 


-38 . 3 


46) 
per 


That the g-loading of performance tests changes with age is 
suggested by Arthur’s (1930) correlations of -81 at six years and 
-26 at 14 years between her battery and Stanford-Binet. We would 
suggest however, that this is merely due to most of her tests 
approaching their ceiling among older children, and to the increas- 
ing verbality of the Stanford-Binet. More appropriate performance 
tests do continue to provide a moderate measure of intelligence in 
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adolescents and adults. Even in an extremely high-grade group of 
255 Civil Service candidates, the writer found a g-saturation of 
about -60 for the Kohs-Misselbrook Block test. Balinsky’s (1941) 
results indicating considerable general factor saturations for the 
Wechsler-Bellevue performance tests from 9 to 60 years, and 
doubtful alterations with age, have already been mentioned (p. 29). 
Alexander’s F and the k-Factor. Alexander’s investigation 
(p. 18) proved that some performance tests do measure a factor 
- beyond g. In his adult group, Picture Cempletion and Formboard 
tests actually obtained larger loadings on his F (practical) factor 
than did the three tests chosen for his performance scale, and 
Porteus Mazes much the same loadings. The three younger 
groups took three of the pictorial tests which Cox devised for 
Measuring m (p. 101). In the youngest groups they appeared to 
acf mainly as g tests, but among the older youths they approxi- 
mated quite closely to the F tests. Possibly because of their com- 
. Plicated instructions they also showed small verbal loadings. These 
results, and others quoted below, Suggest that F is identical with 
the spatial factor k, and that Cox’s m is largely composed of k also. 
The g-content of performance tests such as Alexander’s is a 
Matter of considerable importance in attempting to differentiate 
academic’ and ‘practical’ types of children at 11+. Burt (1947) 
Suggests that the correlation between reliable intelligence and 
performance tests is about “6, making differentiation very difficult. 
Alexander (1947) claims that it j 
published in the Instruction Book o; 


(1) g. 


(2) a verbal group factor in the Moray House 


test, Spearman’s 
v test and teachers’ verbal ability assess i 


ments. 


as 


I 
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(3) a spatial factor in the Alexander tests and Spearman’s k test, 
possibly also in a non-verbal g test and teachers’ assess- 
ments of practical ability. 

Drew’s results for eighty-eight technical school boys (16+) 
are more complex because the boys were clearly selected for g and 
v:ed, and because of the unreliability of the coefficients. But the 
writer’s re-analysis yields: x 

(1) a k factor in the Alexander tests, a formboard, a spatial test 
eand a non-verbal’g test. This enters too into Technical ° 
Drawing and Shopwork marks. F 

(2) a verbal factor in Simplex and the v tests, and in marks for 
English and Science. ; A 

(3) A separate scholastic or X factor in all four sets of marks, 


Emmett (1949) also re-analysed some of Drew’s figures, and 
found a common factor in Alexander’s battery, a k test and a non- 
verbal g test. Williams (1948) included the Alexander scale in an 
analysis of verbal, mechanical, spatial and non-verbal g tests among ~ 
250 twelve-year boys. The v, m and k tests gave distinctive group 
factors, and the Alexander tests fell in the same cluster as the k 
ones. There was no trace of a performance factor distinct from 
paper-and-pencil abilities. Dempster (1948) reports similar 
results with ninety-one 11-year boys,, though he quotes no 


Price (1940) worked with only eighty-five University 


figures. 
He gave three verbal and non- 


students, but his results are similar. 
verbal g tests, three spatial tests, Kohs Blocks, Passalong, Dear- 
born Formboard, Cylinder Construction and Woolley’s (Blind- 
fold) Formboard. After extracting a general factor, a bipolar 
s found which separated the g tests, Passalong and 
spatial and the other performance tests. Pre- 
long and Woolley are the poorest measures of 
-grade adults. The residuals, which are 
statistically insignificant, do provide a very faint trace of a dis- 
tinction between paper-and-pencil and manipulative tests. 
The only relevant evidence collected in the Services was in a 
group of 500 air mechanics, who took the Trist-Misselbrook 
revision of Kohs Blocks. This obtained almost identical loadings 
onthree unrotated factors to those of the Squares spatial group test. 
Conclusion. Before proceeding to mechanical tests it would be 
well to summarize the position. Many of the commonly used per- 


factor wa 
Woolley from the 
sumably then Passa! 
the spatial factor in high 
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formance tests are unreliable, or embody large specific factors, 
hence it is unsafe to use any battery or scale as a measure of g. The 
correlation of such a battery with Binet or group verbal tests is 
further reduced by the considerable v-element in such tests. 
Nevertheless, it is likely that g is the major common factor in all 
except the simplest manipulative performance tests, hence it is 
impossible to differentiate effectively any large proportion of 
children, at 11+, ifto academic and practical types. Most of 
_ the more thorough performance tests, Kohs Blocks and Cube 
Construction in particular, bring in a fair amount of the & factor 
which is measured by spatial group tests, and which is closely 
linked with the m or mechanical factor. Though there is little 
direct evidence, it is probable that some of the simpler ones depend 
too on a manual dexterity factor or factors. If large enough num- 
bers of subjects are tested with a varied battery, minor group 
factors begin to emerge among different types of test—pictorial, 
speeded, etc. But there are no grounds for assuming a broad per- 
formance or practical factor distinct from g, k, m and dexterity. 
Hence, apart from the dexterity element and the greater attractive- 
ness of performance tests to testees, there is no reason why psycho- 
logists should not substitute the more reliable and convenient 
paper-and-pencil mechanicaland spatial tests for performance tests. 


_ Mechanical Tests. Murphy (1936) suggested that tests claim- 
Ing to measure mechanical ability are so 


tions factor and a hand-eye co- 
all their inter-correlations, 


tests that do involve comprehension 
hanisms, none of which was included 
ce that a large portion of what is com- 


tests. Indeed Williams’s research 
revised Bennett Comprehensi 


one to show any clear differentiation, 
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Mechanical Information Constitutes a Distinctive Factor. 
In several investigations in the Services, Test 2 Mechanical 
Comprehension and Test 4 Squares Spatial obtained much the 
same factorial composition (cf. Table V). But when additional 
spatial and mechanical comprehension or information tests were 
analysed, as in Table IX, a separation emerged between these two 
types. Table VII similarly indicates partially distinct k and 
mechanical information group factors. The Uifferent factor load- 
ings of Mec-B, a picture ¢est based on comprehension of the works 
ings of machines and Mec-C, a straightforward information test, 


TABLE XIV. ALTERNATIVE GROUPINGS OF THIRTEEN 
TESTS ANALYSED AMONG 283 R.A.F. FITTERS « a 


Gen-A Verbal 
e 


G-5 R.A.F, Matrices 


Group Test 80 Spatial 
K-6 Spatial 


Vincent Models 
Finger Dexterity 
Mec-A Bennett Mechanical 


“(SP 103 Wirebending 


Mec-B Mechanical Diagrams 
Mec-C Mechanical Information 
117M Mechanical Information 
117E Electrical Information 


are noticeable. Guilford (1948ab) claims indeed that the only 
distinctive element in any mechanical tests, apart from spatial, 
‘dexterity and other factors, is one of information. The difference 
between k and information is further underlined by the rather 
disappointing results obtained with & tests in selecting mechanics 
in the British Services, and the comparatively good results ob- 
tained with information tests (Vernon, 1947b). This may of course 
be due partly to the inverse effects of age on the two types of test 


(cf. p. 33). 


Analyses of Mechanical Tests. The complexity of over- 


lapping between g, k, m, and dexterity, and the extent to which the 
predilection of the factorist may affect the grouping, is neatly 
illustrated by a research in the R.A.F. by Wheeler (1948). The 
tests listed in Table XIV were given to 283 airframe fitters, that is 
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a group with some previous mechanical training. Several factorial 
techniques were applied and the tests fell equally well into the 
four groups or the three groups shown here. In the four-group 
solution, G-5 Matrices goes with the k tests, information tests are 
distinct, and Bennett Mechanical Comprehension + two mani- 
pulative tests yield a separate mechanical factor, But the alter- 
native analysis groups the k tests (excluding G-5) with mechanical 
comprehension, and contrasts them with information -+ Wire- 
< bending; while G-5 goes with the g + tests. In this study the 
general factor (a mixture of g and k:m) covered about 28 per cent., 
and the combined group factors 20 to 22 per cent. of variance. 
The American naval classification battery of twelve tests was 
faétorized by Peterson (1943) to yield three factors called Verbal, 
Mechanical-Spatial and Quantitative Reasoning. This solution 
Was unconvincing, and a re-analysis by the writer gave a more 
logical picture which is entirely congruent with British results. In 
addition to a prominent &, a verbal group factor appears in Reading 
Comprehension, Opposites, Analogies, Completion and Arith. 
metic, and a k:m factor in the remaining seven tests. "Two minor 
group factors are needed, one in the four 
information measures, and on 
Comprehension and Surface D 
Mechanical Information 
in the information field was s 
six tests among 136 naval 
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were to measure more specialized knowledge of different branches 
of engineering, we should doubtless find further group factors 
separating, in the same manner as the secondary school or univer- 
sity subjects mentioned in Chapter IV. 

Manipulative Tests. Assembly or other manipulative me- 
chanical tests tend to give very high correlations with the k:m com- 
plex. In five analyses which included Bennett and Squares paper- 
and-pencil tests, and Army Assembly, Meccano Assembly or 
Wirebending, the mean loadings on k:m of the former were «35, 
and of the latter, practical, tests -76. ‘Their respective mean g- 
loadings were +59 and +25. It seems likely therefore that, just as 
the F of performance tests is covered by, so assembly tests involve 
no factor which is not measurable by paper-and-pencil tests »fck 
and information. We would allow that assembly tests, like some 
performance tests, may bring in a small dexterity component. 
The Minnesota figures (Table X) suggest that if there is any addi- 
tional mechanical manipulative factor among boys, it is quite 
small. Thurstone (1948) is engaged on an extensive research into 
the structure of mechanical ability, and his findings on this point 


will be awaited with interest. 
One relevant analysis was made among 130 Army recruits under 
training as driver mechanics. Table XV shows the unrotated 


TABLE XV. CENTROID AND GROUP FACTORS AMONG 
MECHANICAL TESTS APPIIED TO ARMY DRIVER MECHANICS 


Centroid Factors General 
pe 4 (unrotated) g+k:m | Group Factors 
I II III 


s 2 Bennett Mechanical .654 154 +318 -662 
4 Squares Spatial” +539 -258 089| °521 
Meccano Assembly -876 141 —'266 +924 


Wiring Dexterity +551 -291 —°155 | -531 
Meca] Information | +724 —'280 4 -116 -631 
Mechanical Interests +232 —+298 —:053 +136 
Course Marks -795 —+256 —:085 | *707 


Variance per cent. 42°8 61 3:2 


Thurstone factors, and the group factors extracted by Burt’s 
technique. The Wiring test is a complex manual test of joining 
wires to terminals. Mechanical interests were measured by a test 
of the Strong type. The general factor is, of course, a mixture of g 
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and k-m, but there appear to be minor group factors, of doubtful 
significance, differentiating paper-and-pencil tests, manipulative 
tests, and measures of interest and information (the latter including 
the assessments of proficiency at the end of the course). 

The finding that paper-and-pencil tests will (provided allow- 
ance is made for their higher g-loading) cover almost the same 
ground as practical performance or mechanical tests is of great 
importance to vocational psychologists. Practicai tests naturally 
possess greater ‘face’ validity and may be justifiable on the grounds 
of their attractiveness to candidates and acceptability to employers. 
Yet it is a waste of time and money to use them if their statistical 
validity is not superior. In the investigation just described, the two 
manipulative tests combined correlate -674 with the driver 
mechanics’ course results, and the four pencil-and-paper tests 
correlate -693. 

Many psychologists, including the writer, like performance, 
assembly, or other practical tests not so much because of the pre- 
dictive value of their Scores, as because the testee’s method of 
tackling the problems, his interest and Concentration, etc., appear 
to give valuable qualitative clues (cf. e.g. Oakley and Macrae, 
1937). Here we have the same Situation as with the qualitative 
tests that clinical psychologists apply to neuropsychiatric patients 

ll be ‘something in it’, but if so, the 
tests, over and above the actual test 


Centre tests, and an earlie 
ence, arithmetic and dic: 
mathematical and dictatio 


t Recruiting Centre battery of intellig- 
tation tests. Table XVI shows clear 


matics and the comprehension + spatial tests. 
In another experiment parallel analy. 
given to groups of 240 15-year and 18 
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TABLE XVI. ROTATED CENTROID FACTORS AMONG TESTS 


APPLIED TO 312 NAVAL AIR MECHANICS. (Loadings less than -10 
omitted). 


Test Tech. 
Educ. Arith: Dict. 


SP Test 1, Abstraction 
RD Test, Abstraction 
Recruiting Centre Verbal Intelligence 


SP Test 4, Squares Spatial + 

SP2, Bennett Mechanical Comprehension 

RA Test, Mechanical Comprehension and 
Information 


SP3a, Arithmetic 

SP3b, Mathematics 
Recruiting Centre Arithmetic 
RB Test, Mathematics 


SP Test 74, Dictation 
Recruiting Centre Dictation 
RC Test, Spelling 


Variance per cent. 


15 year group 18 year group 
Key to the Tests 
1—Abstraction. Cox—Cox Mechanical Models. 4—Squares Spatial. 
2—Bennett Mechanical. ` M—Mechanical Information. 97—Memory for Designs, 
36—Mathematics. E—Electrical Information, WB—Wirebending. 


Fig. 6. Graphs of Factor Loadings in 15-year and 18-year 
Artificer Apprentices 
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nger group chiefly contrasts educational tests like 3b 
Sees rine oh ae Mand E. In the older group this 
factor has almost vanished and the main distinction, running from 
R to L instead of from top to bottom, is between technical ie 
tion including mathematics and information on the one hand, an 
practical ability, chiefly represented by Wirebending, on the other. 
The technical education factor is presumably the same as that of 
Table XVI. The spatial tests (4 and 97), Abstraction Test 1 and 
‘Mechanical Comprehension Test 2 stay in much the same posi- 
tions in the two graphs; but 3b, M and E move towards one 
another, and Wirebending moves outwards—in other words 
manipulative skill becomes more distinctive. Cox Models is a 
comprehension test at the younger age, but apparently becomes a 
more practical one later. Fig. 6 omits the first factor loadings of 
the tests, since they were closely similar in the two age groups. 
There was no evidence of greater differentiation of abilities, or 
lowered influence of g with age (cf. p. 30). 
Another instance of alterations in structu 
and/or with level of ability is offered by the a 
to African recruits (p. 106). The results were 
obtained in this country, but showed close: 
chanical, manual and physical abilities on t 
intellectual (g) and educational abilities on t 
ther differences have been found in this c 
and women. 
Analyses Among Women. Six main analyses were carried out 


on A.T.S. tests, three with representative populations of women 
recruits, one with a lower grad 
grade groups of motor mechani 

two latter the results were quite similar to those of men, except for 
the restriction in variance o. 


fg, due to high selectivity. Table XVII 
shows the group-factor anal 


ysis loadings for special operators. 
Note that Spelling (SP 14) and Educational Standard have 
especially low g-saturation: 


Matrices, measure a k:m fa 
Most interesting finding 
Models test. The k:m 


-44, Squares -47, Assembly +75, Meccano Assembly -64, Vincent 
Models -17. 


re with experience 
alysis of tests given 
congruent with those 
r integration of me- 
he one hand, and of 
he other hand. Fur- 
ountry between men 


Much greater difficulty was experienced in arriving at acceptable 
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factorial solutions among normal women (cf. Table XVII) and 
cooks. Not only did the Clerical and Spelling tests appear to 
obtain unduly high g-loadings, but Bennett Test 2 had extremely 
low g and k:m; presumably it measures a very largely specific 
factor in women. Further the Matrices test could not be differ- 
entiated from the k:m tests. Note the similarity of this result with 
a non-verbal g test to the results obtained by Slater among 
11-13-year children, and to those of Melfone and McFarlane 
(pp. 68, 107). Either we must admit then that tests such a$ 
Matrices do depend largely on k:m in women—and this is the 
solution adopted in Table XVII—or that the k:m factor has hardly 


TABLE XVII. _GROUP-FACTOR ANALYSES AMONG 200 REPRE- 
SENTATIVE A.T.S. RECRUITS AND 200 SPECIAL OPERATORS 
DŘ ES 


Test Normal Group Special Operators 
g vied k:m h? g vied k:m h? 
Progressive Matrices | +75 $320 || 67, 68 “00 
2 Bennett Mechanical | +41 229) +26 "43 eh q 
4 Squares Spatial -53 *51 "55 38 -50 -39 
24 Meccano Assembly | +43 -55 -49 — = 
14 Spelling "64-51 67 "16°74 Mi se 
23 Arithmetic +68 "34 -58 °49 -51 -50 
12 Clerical -81 -26 “72 +38 52 “41 
17, 25 Verbal *66) "93: “71 °S2 42 “44 
Educational Standard | +57 +45 +53 Pek] a 32 
Variance Per cent. | 38:8 16-2 8-4 | 57:4 | 18:9 18-9 4:3 | 42:2 


differentiated in Test 2 and only partially in Test 4 Squares and 
24 Assembly. The second view would not only mean that spatial- 
mechanical tests measure hardly anything but g, but also that the 
v-ed tests would obtain smaller g-loadings than in men and larger 
ved loadings. A factorial investigation by Banks (1949) of large 
groups of Army and A.T.S. recruits confirms the first solution. 
Probably the main reason for the difference between men and 
average women is that the SP non-verbal tests give an even less 
adequate sampling of women’s practical abilities than they do of 
men’s. Had there been tests of, say, domestic and social abilities 
to constitute a contrasting pole with the v:ed tests, a more accept- 
able patterning of abilities might have been found. 
Conclusions. The evidence of this and the preceding chapter 
indicates that we can hardly expect to discover a structure within 
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the k-m complex as definite as that existing in the verbal-numerical 
field, since so much depends on the experience, training and back- 
ground of the testees, and on the extent to which they are selected. 
Psychologists recognize nowadays that muscular development in 
the infant is neither differentiation of specific skills from general- 
ized mass movements, nor the integration of elementary reflexes 
into complex habits, but a combination of both. Similarly we may 
expect group factors in mechanical ability to differentiate when 
Specialized functions are practised, and to reintegrate into fresh 
patterns. Since considerable growth of k:m occurs in the average 
male adolescent (cf. Vernon and Parry, 1949), it is probably 
accompanied by alterations of structure, depending on the skills 
trained in his employment or hobbies. And as any group of sub- 
jects large enough to yield significant factors is likely to be ex- 
tremely heterogeneous in experience and training, such factors 
cannot be very clear-cut and cannot reveal much information 
about the abilities of any particular individual, Vernon and Parry 
(1949) point out also that k and m tests are considerably more 
useful in predicting trainability for mechanical jobs at 13-15 years, 
and among women, than among male adults. This too would be 
expected if k:m tends to differentiate and to become more com- 
plex in structure in men after 15: 

We should not expect investigations of the kind summarized 
above to throw much light on the problem of whether or not 
mechanical aptitude is innate. It would be foolish to deny the 


such as musical clearly possess an 
ld recognize that most of what the 


CHAPTER XI 
OCCUPATIONAL ABILITIES 


Abstract. A prominent general factor of aptitude for all joh$ 
appears to be compounded of g, v-ed, and a drive component 
closely related to X. Beyond this, job abilities group into the more 
bookish and the more practical, and possibly other major types. 
But the scope of these group factors is probably small comparéd 
with that of minor group or specific factors. Hence transfer of 
ability from one specialized type of work to another is limited. 
Also the overlapping between factors established among tests and 
job factors is disappointing, so that vocational guidance (as dis- 
tinct from selection) cannot hope to advance very far merely by 
trying to predict aptitude for general types of work gon scores 
on a battery of tests. 

Trainability—General or Specific. Itis much more difficult 
to map out the main factors in vocational than in educational 
abilities, for the obvious reason that people do not work at several 
jobs simultaneously as they do at several subjects of study. More- 
over, the persons in any single job tend to be more selected, either 
for intelligence or trade experience, than school pupils. However, 
several lines of investigation have been partially explored. 

During the war it was possible to observe the trainability of men 
and women from different civilian occupations in different Service 
jobs. The importance of previous direct experience, i.e. of small 
group or specific job factors, was eyident among drivtrs, tele- 
graphists, clerks, and many types of tradesmen (cf. Vernon and 
Parry, 1949). For example, the correlation between previous 
driving experience and success at Army driving was higher than 
that of any battery of psychological tests. It is fair to state also that 
transfer effects were very limited, i.e. that experience in other 
superficially similar jobs was often of little or.no value, though 
unfortunately no really reliable quantitative data are available. 
Thus machinists and fitters’ and turners’ mates did not do any 
better as naval ordnance, or engine room mechanics than did 
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men from non-engineering occupations such as retail tradesmen, 
and no type of mechanical background appeared to assist radio 
or electrical mechanics. Much more influential—though this is 
difficult to prove—was a kind of general factor comprising in- 
telligence and educational level and personality traits such as 
keenness, drive, seriousness of purpose. For example, clerks and 
policemen who tended to be high in such a factor did well at almost 
any job to which they were allocated, and usually surpassed men 
with a more practical background, Again it often happened that 
recruits who failed in one Service job had to be reallocated, and it 
was usually found necessary to move them to a job requiring lower 
general ability and application. If they were transferred to 
another job at the same level, in which they claimed some interest 
or experience, only too often they failed again. The layman’s 
notion that there exists a niche or special type of work ideally 
suited to the specialized aptitudes of each individu: 
be much less true than the vi 
employees fall along a single high-grade to low-grade continuum. 
The success of women workers 
the war further supports this vi 
recently by Philpott (1947), 
was in most instances one of rapid trainability, and different re- 
sults might have followed ha 
able. But so far as our rathe 


provided a most useful ind 
(Lummis, 1946; Vernon a 
lations were obtained betw. 
Proficiency in the Services, 

Common Factors in Job Anal 


yses. A second approach is 
that of Coombs and Satter (1949), 


who had fifty-four jobs in a 
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large paper mill assessed by job analysis experts for the skills, 
knowledge, etc., required. The twenty jobs were then taken which 
showed fewest common elements, namely thirty-five to seventy- 
four out of a possible 104 elements, and correlations were derived 
from the numbers of elements and factorized. After rotation a 
clear-cut general + group factor pattern was obtained. The 
general factor, covering 43-4 per cent. of variance, represented 
common features or requirements in all types of job. Four 
‘families’ of jobs, with combined variance 17-0 per cent., included: 


A. Self-responsible jobs, e.g. employment interviewer, librarian 
supervisor. 

B. Routine entry occupations, e.g. messenger, receptionist, 
nurse, shop clerk. E 

C. Skilled manual, e.g. multigraph, multilith and key punch 
operators. d 

D. Clerical, 


The method might fruitfully be extended to a wider range of 
jobs, and would no doubt yield additional types. The authors 
point out that the size of the general factor largely depends on the 
skill and objectivity with which the assessments are made. But as 
especial care was taken in this study to avoid subjectivity, and as 
only the most independent jobs were factorized, its large variance 
(relative to the group factors) confirms the conclusions of the 
preceding paragraph. : 

Analyses of Training Marks. Obviously it would be better to 
analyse the actual skills of groups of workers, and useful evidence 
was collected in the Services when trainees were marked or rated 
for several different sections or aspects of an occupation. The 
major differentiation between v-ed and k:m types of work recurred 
as constantly as it does in the fields of education and of psycho- 
logical tests. Thus among naval, Atmy and A.T.S. signallers and 
telegraphists, ability at theory or bookwork is partially distinct 
from ability at Morse. Similarly, informational attainments among 
clerks contrast with typewriting and stenography marks, technical 
acquirements among R.N.V.R. officer cadets with personality 
qualities, and so on. Table XVIII shows three (unrotated) factors 
extracted from eight sets of marks awarded to 250 naval engine 
room mechanics. A fairly prominent general factor runs through 
all the marks, but the bipolars indicate two group factors: 
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(1) Written general paper and electricity—predominantly theor- 
retical subjects. 


(2) Sheet metal work and fitting. 
The remaining four subjects all give different factor patterns. 


TABLE XVIII. CENTROID FACTORS IN THE COURSE MARKS 
OF ENGINE ROOM MECHANIC TRAINEES 


Subject 


© | Written theory paper "3% —-31 —-49 
‘| Electrical Equipment and Wiring "56 —-56 —-07 


eer eS .,| 
Sheet Metal Work -63 34 “08 £53 
Precision fitting *66 "39 —-05 Re!) 
Gn ne eee Ey 
Garage “77 —:09 —-04 62 
Engines 


“74 —10 +47: e77 
Oxy-acetylene welding 49 “08 = 
Centre Lathe 


Variance Per cent. 


, for example, yielded distinctive 
group factors for theory or bookw 


transmitting, and for visual signalling, 
men and electrical mechani 
specificity, only 38 per cent. of 
because the duties in which th 
diversified, Among electrical m 
factors representing: 

(1) Mining and Workshop. 

(2) Gyro Compass and Whitehead (Torpedo). 


(3) Schoolwork, Preliminary Electricity and Low Power 
Electrical Equipment, 


Analysis of Objective Measures of Workshop Ability. Now 
all training marks, even when awarded by independent examiners, 
are liable to be influenced by the examiners’ opinions of the 


trainees as individuals, their industriousness, alertness, etc. 
When objectively marked iti 


1s considerably reduced. A 
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the six months’ workshop training and theory course, most of them 
being based on standard test pieces in fitting, turning, etc. While 
seven sets of theory examination marks gave unrotated factors 
with variances 46-1, 8-1 and 4-1 (total 58-4 per cent.), the average 
result for three analyses of objective practical marks was 28-6, 
10-6, 4-7 (total 44-1 per cent.). If the bipolar factors were con- 
verted into group factors, we should probably find that the group 
factors, corresponding to different types of job, are more prominent 
than general workshop ability. A specimen analysis of fitting jops 
is shown in Table XIX. The second factor differentiates the first 
month’s practice jobs from the second month’s jobs and tests. 


TABLE XIX. CENTROID FACTORS IN WORKSHOP PROFICIENCY 
MEASURES AMONG 122 NAVAL ELECTRICAL MECHANICS 


Marks 


Job No. 5478 

5848 

5001 

5050 

5867 

» m» 5838 
Practical Test 1 
2 


Variance per cent. 


The third differentiates Nos. 5478 and 5848, which involve angles 
other than right angles, from Nos. 5001 and 5050 which involve 
right angles only. No. 5848 which brings in the greatest number 
of different angles has the highest general factor loading and 
cémmunality. The third factor also separates the second month’s 
precision jobs from the tests which were done under examination 
conditions without any advice from, instructors. Thus different 
conditions of work as well as different types of job affect the cor- 
relations. Turning, shaping, thread-cutting, and other operations 
similarly yielded partially distinct factors in other analyses. The 
contrast between theory and workshop analyses is even more 
striking when it is pointed out that the trainees were extremely 
highly selected for intelligence, mathematical and electrical know- 
ledge, whereas they were unselected (apart from good Test 2 and 
4 scores) on the practical side, and were drawn from a great 
variety of—mostly non-mechanical—occupations. 


126 The Structure of Human Abilities 


The conclusion follows, then, that though a general ability at, 
or aptitude for, mechanical work does exist, quite apart from g, it 
is of small extent when objectively assessed, and factors specific 
to the particular type of operation or machine are more important. 
Each test job, occupying the trainees several days, may be aptly 
compared to a single mathematical test item, in reliability and in 
overlapping with other jobs or items, Hence, a very extensive' 
sampling of jobs ovef months would be needed to yield a reliable 
objective criterion of workshop ability, although doubtless an 
experienced instructor could arrive at a fairly reliable—ye® more 
subjective—estimate in a briefer time. 

The only comparable study in the literature is that of the 
Mianesota investigators, who unfortunately do not report the 


numerous psychological tests. 
Wheeler (1948) among R.A.F, 
marks alone yielded a general 
bipolars with 15-5 per cent. of 
the marks awarded under exami 
during the course, also the mo: 
tical marks. So far this duplicates our results quoted above for 
and electrical mechanics, When the marks were 
hological tests (cf. Table XIV) the 


bipolar contrasted the theory 
ts. 


_ Numerous follow-up studies (cf. Vernon and Parry, 1949) have 
similarly shown better validity for ved tests in work of a clerical, 


verbal or theoretical nature; k:m tests have obtained relatively 
of a practical nature, though ved tests 
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structure of job abilities appears to parallel closely that of educa- 
tional abilities, and may be roughly represented as in Fig. 7. 
G, X and v-ed together make up general occupational ability; this 
may be partially split into verbal, mechanical-practical and possibly 
other types (e.g. managerial, dealing with people, etc.), which 
themselves sub-divide indefinitely. As in education again the 
general factor and small group or specific factors cover much more 
variance than the major pete 


General Oceepaeconia Ability 


Pe 
Verbal Other Major Mechanical 
Type Job Families Type 


| || 


Minor and more specialised types of jobs 


BPRS EES 


Specific Job Abilities 
Fig. 7. Diagram illustrating the Structure of Occupational Abilities 


“We have no sure information as to whether jobs group in accord- 
ance with the other main test factors described in preceding 
chapters. Nevertheless, it is obvious that some jobs require much 
stronger physique than other, so that our physical factor may link 
up with these. In spite of the low validity and specificity of manual 
dexterity tests, they do often yield moderate correlations with 
machine operating and assembly jobs. The Division of Occupa- 
tional Analysis claims that its non-verbal perceptual factor is 
especially relevant to visual inspection jobs in industry, but 
provides no evidence of validity for this, nor for the three psycho- 
motor, factors. It is interesting, none the less, that the Matrices 
test and Group Test 70 (although apparently little dependent 
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kind of practical efficiency in adults (cf. Vernon, 1947b). The 


re highly correlated 
with success at mechanical jobs in general, also specialized trade 


, per- 
ceptual, manual, or by rote memory, attention and other types of 
test.than we do already with g, v:ed and k:m tests. Certainly it is 
worthwhile exploring the relevance of the established test factors 
for groups of jobs. But it appears unlikely that vocational guid- 


this is the aim of the Division of Occupational 
many American psychologists. In order to tap 
factors, we should almost certainly have to resor 
ate, expensive and time- 
have their place in vocati 
for guidance, Psychologis 


Motivation or X 4 


and his more specific 
der Consideration. 


APPENDIX 


GENERAL -+ GROUP FACTOR VS. MULTIPLE FACTOR 
„ THEORIES |: 


IVERGENCES between the methods and cenclusions of 
[iia and American investigators in the field of factor 

analysis are much less acute now than they were ten gears 
ago. But there is still sufficient difference in their views on mental 
structure to make it essential to show why general + group-façtor 
solutions are considered superior in this book. 

Factorial Techniques. It should be pointed out first that 
Thurstone’s centroid technique, being perhaps the simplest to 
apply, is very widely used in Britain. The main difference is that 
we either do not rotate the resulting factors, but use them to 
indicate what group factors are present before starting a true group- 
factor analysis, or else we rotate in such a way as to maximize 
rather than minimize a general factor. Conversely in America, 
Holzinger has always favoured group-factor techniques, and R. B. 
Cattell (1946) regards the hierarchical picture of abilities as 
superior to the multiple factor. There are a few minor differences 
of technique. Some British workers prefer Burt’s original Simple 
Summation to the Centroid method, though the latter has a neater 
device for reflecting signs when extracting bipolar factors. Usually 
also we guess the communialities for insertion in the diagonal cells 
and repeat the analysis several times until the guesses approximate 
to the correct values, instead of—liké Thurstone and his followers 
—inserting the highest correlation in each column at each stage. 
Thurstone’s short-cut is liable to exaggerate the communialities, 
and therefore the size and number of the later factors (cf. Burt, 
1938). But Burt’s more accurate successive approximation tech- 
nique is hardly practicable when the number of tests is large, say 
fifteen or more, or the number of factors is large. 

Number of Statistically Significant Factors. A more im- 
portant difference between the methods commonly used by fac- 
torists in the two countries is that British writers often stop at two 
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factors (or g + two group factors) and seldom exceed four, whereas 
Americans seldom extract less than five and often exceed twelve. 
Both agree that factors should satisfy a criterion of statistical 
significance, but American criteria seem to us so lax that many of 
the later factors they obtain from smallish populations must be 
considerably distorted by chance errors. Spearman (1939) 
strongly criticized Thurstone’s original primary factor study on 
these grounds. Unfortunately centroid analysis (unlike Maximum 
Likelihood, Principal Components and some other techniques) is 
mathematically, a rather rough and approximate technique, 
hence its exdct sampling errors are unknown. It seems probable 
that the chi-squared tests advocated by Burt (1940) are unduly 
strict, and that the checks popularly used in America, namely, 
Tucker's and Coombs’s, are too lax. But Burt and Banks’s (1947) 
most recent formula for the Standard Error of single loadings,* 
and McNemar’s (1942a) logically derived formula for the signifi- 
cance of residuals, have both been checked empirically and, in the 
writer’s experience, yield much the same results. Guilford and 
Lacey’s criterion is the simplest,” and though much more rough, 
it too agrees in showing that most British factorists have in the 
past been too cautious, most American ones too indulgent. 
Disagreement Regarding G. There is thus no essential dis- 
agreement on mathematical points, and provided that group-and- 
multiple-factor analyses account equally well for the original 
correlations by means of the same limited number of factors, they 
are equally legitimate. Moreover, many American factorists do, 
like British ones, find a g-factor nowadays, as was shown in 
Chapter II. But it still remains true that British writers make g 
as large as possible, and posit group factors only when the ré8i- 
duals necessitate them, whereas Americans either introduce gasa 
second order factor, or, if a, primary one is unavoidable, tend to 
minimize it. Again British workers recognize larger or more com- 
prehensive group factors together with sub-factors ‘descended’ 


f : a) Vn 
S.E. of a loading, r= VNG ay where n=number of tests, and 
a = 5 
The writer would suggest that about half the loadin Sete eine 
Sa ior a ficto A a whole to be regarded as significant. 
amely, that the product of the two high ings i 
onnie; should exceed the S.E. of zero k: Ù e e ga aactor; REAR 
adhere to his own rule, hence several of th fact i in hi z 
researches are certainly not significant, e factors claimed in his U.S.A.A.F: 
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from them, whereas American primary factors more often all 
possess much the same status and variance. Not only do such 
primary factors seem, from our standpoint, to carry some of the 
variance that would be better attributed to g, but also one or more 
of them (usually a Reasoning factor) may consist wholly of g. 

The main arguments are as follows. N 

(1) Size of G in Unselected Populations. G is so much larger 
than all other factors put together in unse ected populations of 
adults or children that it's psychologically foolish as well as mathé- 
matically difficult to belittle it. It may legitimately be asked why 
representative populations should be taken as a standard, rather 
than selected ones such as college students, among whom it is often 
quite difficult to establish a g. The writer would agree that analyses 
of selected groups like Thurstone’s and Guilford’s are useful for 
bringing up group factors which might otherwise be obscured. 
But Thurstone (1945) himself acknowledges that selection, though 
not affecting the main factor pattern, does distort the sizes of the 
loadings. And he points out that when selection is complex (based 
on several variables) artificial factors may be introduced. In most 
investigations of college students selection is based on a great 
variety of socio-economic, educational and other influences which 
are unlikely to be correlated with any factor other than g. Hence 
the g-variance only is reduced and no great harm is done. But 
Guilford’s pilots were mostly doubly selected, by the U.S.A.A.F. 
Qualifying Examination and by the Aircrew Aptitude battery, 
that is to say by tests which are themselves more closely correlated 
with some of the tests to be factorized than with others. And this 
is likely to have played havoc with the obtained correlations and 
the resulting factor patterns. True Dudek (1948) has tried to show 
that the same factors do emerge in a group selected by the Qualify- 
ing Examination, and in a group ef women pilots, a9 in an un- 
selected group of candidates. But in fact he gets parallel results 
only for five of the most commonly accepted factors; the smaller 
factors differ considerably in these three populations. Moreover, 
none of these groups had been doubly selected as had many of 
those in which Guilford’s more dubious Reasoning, Integration 
and Spatial factors occurred. 

(2) Greater Stability of G and the Major Group Factors. 
Burt has claimed that group-factor solutions are more invariant or 
stable than primary factor ones, that is less liable to vary in differ- 
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ent populations or with alterations in the battery of tests. 
While this might be difficult to prove, it is clear that g, the major 
group factors and some of the minor ones almost always turn up 
in much the same form in any reasonably representative group. 
In contrast American writers have described at least a hundred 
ability factors of one kind or another, which are only partially 
reconcilable with Thurstone’s, though it is true that Thurstone’s 
own results show greater uniformity. Balinsky’s (1941) investiga- 
tion of the Wechsler-Bellevue scale is interesting in this respect. 
In all his age groups there was a clear differentiation between 
verbal and performance tests, but no other consistent factors were 
found in several successive groups. The Division of Occupational 
Analysis also found considerable variations in their nine studies, 
but the most stable factors were V, N and S. Surely then it is more 
logival to stress the major general and group factors which almost 
always turn up and whose statistical significance is indubitable, 
and to admit that the minor ones are so dependent on the parti- 
cular set of tests and the heterogeneity and background of the 
particular populations tested that they do not merit the designation 
‘primary’. 

(3) Quickness of Group and Multiple-Factor Techniques: 
Relative Subjectivity. Group-factor analysis is very much 
quicker. Given the correlations between twenty tests the present 
writer can usually perform the analysis in a day, where a controid 
analysis with rotation would take him a week. Banks (1948) makes 
a similar observation in re-analysing some of Cattell’s personality 
data. Even when a preliminary centroid analysis is made in order 
to indicate objectively what group factors are present, the time 
saved on successive approximation to communalities and ön 
rotation is very considerable, It must be admitted, however, as a 
counter-argument, that there-is rather a large element of sub- 
jective choice in most group factor analyses, even when they are 
guided by centroid results.1 One naturally tends to aim at factor 
patterns which are consonant with those obtained for the same 
tests in previous studies, and may therefore fail to realize that some 
of these preconceptions are wrong. Thurstone would claim that 
rotation to simple structure is objective, i.e. that there is one best 
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solution to each factorial problem which maximizes the number of 
insignificant loadings. Indeed some factorists such as Cattell 
actually carry out their rotations without knowing which test is 
which. But it is obvious that the majority are less strong-minded, 
and that their rotations are made in the light of their judgment of 
the content of the factorized tests. Centroid analysis may therefore 
be as subjective as group factor. 

(4) Divisibility of Primary Factors. Æs would be expected 
on the hierarchical theory, more detailed investigation of parti 
cular primary factors frequently splits them up into smaller com- 
ponents. Examples are provided by Davis’s and Langsam’s studies 
of reading ability, Guilford’s of the spatial factor, Taylor’s and 
Carroll’s studies of V and W, etc. True, Thurstone himstif is 
careful not to claim that his factors are ultimate mental elements, 
but if the term primary is used, it is curious that they should be so 
unstable. When Simple Structure is aimed at, there is no ready 
means of indicating that some factors are more comprehensive than 
others, or can include others. Oblique and second order factors 
could indeed cover this situation, but no American factorist has yet 
systematically analysed a varied battery of tests into several higher 
order factors and sub-factors, along the lines of the group-factor 
analyses in Tables V and IX. 

Unless some such plan is adopted it is difficult to see where 
factorization is to stop. Guilford’s work indicates that almost any 
specific factor (in Spearman’s sense) can be turned into a primary 
factor, given sufficient ingenuity in test construction. The view 
was put forward in Chapter III that highly specialized factors, 
which have no appreciable significance for everyday life, are not 
worth isolating. If this is agreed, it necessitates some distinction 
between the more fundamental factors, which are preferably ex- 
tracted first, and the narrower ones. In other words) factorists 
should aim not merely to reduce large numbers of variables to a 
few components that account for their inter-correlations, but also 
to reduce them to the fewest components which will cover most 
variance. 

(5) No Test Measures a Single Factor. In spite of many 
efforts, no psychologist has been able to devise a test which is truly 
‘univocal’, i.e. which measures only a single primary factor (apart 
from error variance). Guilford and Michael (1948) admit that in 
order to measure a person’s factor score, it is usually necessary to 


K 


134 The Structure of Human Abilities 


add ‘suppressor variables’, that is to subtract weighted scores on 
other tests in order to eliminate the unwanted g or other content. 
Why not admit then that all tests do involve g, instead of arti- 
ficially removing it by means of rotation? ; 

(6) Hierarchy a Statistical Artefact? One argument in the 
opposite direction is that the notion of hierarchy arises merely 
because centroid analysis yields a general factor and a series of 
bipolar factors sub-aividing the tests into smaller groups. This 
nuight be answered by pointing out that group-factor analysis pre- 
ceded multiple factor in psychology. Moreover, in most analyses, 
g differs considerably from the first centroid factor, and group 
factors do not necessarily correspond to the bipolars; any one 
group factor often combines parts of the variance of the first 
factor and of two or more bipolars. Finally, we have admitted 
(p. 25f) that the strict hierarchy of Fig. 1 is an over-simplification. 

(7) Psychological Soundness of Procedures Deriving from 
the Opposed Theories. Perhaps the most important objection 
to a ‘neo-faculty’ viewpoint is that it tends to encourage undesir- 
able practices in educational and vocational guidance. Although 
factorists themselves are well aware of the dangers of the ‘naming 
fallacy’, the users of their tests are not, If testers are told that a test 
is a good measure of the verbal factor, or the memory factor, etc., 
they jump to the conclusion only too easily that such a test will 
predict ability for any job, or type of education, which seems to 
them to involve verbal ability, or memory. The Division of 
Occupational Analysis’s General Aptitude Test Battery provides a 
flagrant instance. Personnel officers will naturally suppose that 
candidates for jobs apparently requiring hand-eye co-ordination 
should score well on the hand-eye co-ordination factor. But in 
actual fact the score for this factor is based purely on two tests of 
drawing lines and making dots accurately, and there is no evidence 
whatever as to the validity of these tests for any job. Previous 
research would suggest that it is extremely low. We do not 
question the great utility of batteries of differential aptitude tests 
such as this one, or Guilford and Zimmerman’s (1948) and others. 
But they should not be published without objective evidence of 
their correlations with job success; and if, as is usually the case, 
the factorial structure of the tests is complex and their loadings 
with practically important factors over and above g and v:ed or 
k:m is small, this should be made clear. 
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This point is much more obvious to the vocational or educational 
psychologist who accepts the hierarchical theory. Moreover, this 
theory considerably simplifies his or her task. Instead of having to 
apply a very lengthy battery such as the G.A.T.B. to all candidates 
for guidance, in order to determine their profiles on ten factors, he 
realizes that a short battery of ved and k:m tests, which can be 
given and scored in one hour, will take him a very long way.* It 
will cover most of the ground in educational dt vocational predic- 
tion that can ever be covered by tests. He can, however, proceed toe 
give tests of further minor factors (perceptual, manual, etc.) or 
tests of a more work-sample type referring to partictilar jobs, in 
order to achieve a gain in accuracy of some 5 to 10 per cent., if this 
appears to be needed in any individual case. Again, the hier- 
archical viewpoint justifies the use, for many purposes, of the I.Q. 
or of comparable measures of adult intelligence, that is of g or 
g +v tests alone. To the multiple factorist the I.Q. is meaning- 
less compound, and Thurstone, Guilford and others strongly 
advocate the substitution of tests of half a dozen, or a dozen or 
more, factors. But in spite of their criticisms, the I.Q. is found as 
useful and convenient by American practising psychologists as by 
British ones. Even if they do not realize it, such psychologists are 
thereby committed to a hierarchical rather than a multiple-factor 
viewpoint. And they are less likely to be led astray by their present 
procedures than they would be by ambitious schemes for differen- 
tial testing of all the main factors. 


1 Note that the combined score on such a battery will provide a measure of g, 
without the need of any tests of g alone. Similarly it follows from the hierarchical 
theory that scores on tests of minor group factors will add up to give a measure 
of the major group factor from which they derive. 

2 Similarly it justifies the use of general reading attainment tests instead of 
separate tests for different reading factors which Davis and others advocate. 
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